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To the Student 

These notes are provided for your benefit as an attempt to organise the salient points of the course. They 
are a very terse account of the main ideas of the course, and are to be used mostly to refer to central 
definitions and theorems. The number of examples is minimal, and here you will find few exercises. 
The motivation or informal ideas of looking at a certain topic, the ideas linking a topic with another, the 
worked-out examples, etc., are given in class. Hence these notes are not a substitute to lectures: you 
must always attend to lectures. The order of the notes may not necessarily be the order followed in 
the class. 

There is a certain algebraic fluency that is necessary for a course at this level. These algebraic 
prerequisites would be difficult to codify here, as they vaiy depending on class response and the topic 
lectured. If at any stage you stumble in Algebra, seek help! I am here to help you! 

Tutoring can sometimes help, but bear in mind that whoever tutors you may not be familiar with 
my conventions. Again, I am here to help! On the same vein, other books may help, but the approach 
presented here is at times unorthodox and finding alternative sources might be difficult. 

Here are more recommendations: 

• Read a section before class discussion, in particular, read the definitions. 

• Class provides the informal discussion, and you will profit from the comments of your classmates, 
as well as gain confidence by providing your insights and interpretations of a topic. Don’t be 
absent! 

• Once the lecture of a particular topic has been given, take a fresh look at the notes of the lecture 
topic. 

• Try to understand a single example well, rather than ill-digest multiple examples. 

• Start working on the distributed homework ahead of time. 

• Ask questions during the lecture. There are two main types of questions that you are likely to 
ask. 

1 . Questions of Correction: Is that a minus sign there? If you think that, for example, I have missed 
out a minus sign or wrote P where it should have been Q, 1 then by all means, ask. No one 
likes to carry an error till line XLV because the audience failed to point out an error on line I. 
Don’t wait till the end of the class to point out an error. Do it when there is still time to correct 
it! 

2. Questions of Understanding: I don’t get it! Admitting that you do not understand something is 
an act requiring utmost courage. But if you don’t, it is likely that many others in the audience 
also don’t. On the same vein, if you feel you can explain a point to an inquiring classmate, I 
will allow you time in the lecture to do so. The best way to ask a question is something like: 
“How did you get from the second step to the third step?” or “What does it mean to complete 
the square?” Asseverations like “I don’t understand” do not help me answer your queries. If 
I consider that you are asking the same questions too many times, it may be that you need 
extra help, in which case we will settle what to do outside the lecture. 

• Don’t fall behind! The sequence of topics is closely interrelated, with one topic leading to another. 

• The use of calculators is allowed, especially in the occasional lengthy calculations. However, when 
graphing, you will need to provide algebraic/ analyt ic /geometric support of your arguments. The 
questions on assignments and exams will be posed in such a way that it will be of no advantage to 
have a graphing calculator. 

• Presentation is critical. Clearly outline your ideas. When writing solutions, outline major steps and 
write in complete sentences. As a guide, you may try to emulate the style presented in the scant 
examples furnished in these notes. 


1 My doctoral adviser used to say “I said A, I wrote B, I meant C and it should have been D! 


Chapter 

Preliminaries 

1.1 Sets and Notation 

1 Definition We will mean by a set a collection of well defined members or elements. 

2 Definition The following sets have special symbols. 

N = {0, 1 , 2, 3, . . .} denotes the set of natural numbers. 

Z = {. . . , —3, —2, — 1 , 0, 1 , 2, 3, . . .} denotes the set of integers. 

Q denotes the set of rational numbers. 

R denotes the set of real numbers. 

C denotes the set of complex numbers. 

0 denotes the empty set. 


3 Definition (Implications) The symbol 


is read “implies”, and the symbol is read “if and only if.” 


4 Example Prove that between any two rational numbers there is always a rational number. 
Solution: ► Let (a, c) E Z 2 , (b, d) E (N \ {0}) 2 , § < -§■ Then da < be. Now 
ab + ad<ab + bc =$ a(b + d) < b(a + c) 


a a + c 
b < b + d’ 


whence the rational number 


da + dc < cb + cd =^> d(a + c) < c(b + d) 
a + c 


a + c c 

< 


b + d d’ 


lies between £ and L . ^ 
b + d b d 


We can also argue that the average of two distinct numbers lies between the numbers and 

Ti + r2 

so if r-i < V 2 are rational numbers, then — - — lies between them. 

5 Definition Let A be a set. If a belongs to the set A, then we write a E A, read “a is an element of A.” If 
a does not belong to the set A, we write a g? A, read “a is not an element of A.” 
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6 Definition (Conjunction, Disjunction, and Negation) The symbol V is read “or” (disjunction), the symbol 
A is read “and” (conjunction), and the symbol — ■ is read “not.” 

7 Definition (Quantifiers) The symbol V is read “for all” (the universal quantifier), and the symbol 3 is read 
“there exists” (the existential quantifier) . 

We have 


8 Definition (Subset) If Va £ A we have a 6 B. then we write A C B, which we read “A is a subset of B. 
In particular, notice that for any set A, 0 C A and AC A. Also 


D3T A = B V=V (ACB)A(BC A). 

9 Definition The union of two sets A and B, is the set 

A U B = (x : (x £ A) V (x £ B)}. 
This is read “A union B.” See figure 1.1. 

10 Definition The intersection of two sets A and B. is 


“■(Vx £ A, P(x)) ^ (3 £ A,— -P(x)) 
— ‘(3 £ A, P(x)) V=V (Vx £ A,— 'P(x)) 


( 1 . 1 ) 


( 1 . 2 ) 


NCZCQCKCC. 


A n B = (x : (x £ A) A (x £ B)}. 


This is read “A intersection B.” See figure 1.2. 


1 1 Definition The difference of two sets A and B, is 


A \ B ={x : (x £ A) A (x £ B)}. 


This is read “A set minus B.” See figure 1.3. 





A B 


A B 


A B 


Figure 1.1: A U B 


Figure 1.2: A n B 


Figure 1.3: A \ B 


12 Example Prove by means of set inclusion that 


(A u B) n C = (A n C) u (B n C). 
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Solution: ► We have, 


x e (AuB)nc 


x 6 (AUB)Axg C 
(xeAVxeB)Axec 
(x£AAx6C)V(x6BAxgC) 
(x 6 A n C) V (x <E B n C) 


x e (A n C) u (B n C), 

which, establishes the equality. ◄ 

13 Definition Let At , A 2 , . . . , A n , be sets. The Cartesian Product of these n sets is defined and denoted 
by 

At x A 2 x • • • x A n = {(a! , a 2 , . . . , a n ) : a k 6 A k }, 
that is, the set of all ordered n-tuples whose elements belong to the given sets. 


In the particular case when all the A k are equal to a set A, we write 

A-i x A 2 X • • • X A n = A n . 

If a 6 A and b 6 A we write ( a, b ) 6 A 2 . 

14 Definition Let xgl. The absolute value ofx — denoted by |x| — is defined by 


I 

—x if x < 0, 

M = < 



x if x > 0. 


1 


( 1 . 3 ) 

( 1 . 4 ) 

( 1 . 5 ) 


It follows from the definition that for xgl, 

— |x| < X < |x|. 

t > 0 =^> |x| < t — t < x < t. 
Va e R =4 V / a 2 = |a|. 


15 Theorem (Triangle Inequality) Let (a, b) e R 2 . Then 

|a + b| < |a| + |b| 


Proof: From 1.3, by addition, 


— |a| < a < |a| 


to 

— |b| < b < |b| 

we obtain 

— (|a| + |b|) < a + b < (|a| + |b|), 
whence the theorem follows by 1.4. □ 


( 1 . 6 ) 
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Homework 

Problem 1.1.1 Prove that between any two rational 
numbers there is an irrational number. 

Problem 1.1.2 Prove that X \ (X \ A) = X n A. 

Problem 1.1.3 Prove that 

X \ (AU B) = (X \ A) n (X\ B). 

Problem 1.1.4 Prove that 

X \ (An B) = (X\ A) U (X\B). 


Problem 1.1.5 Prove that 

(AU B) \ (A n B) = (A\ B) U (B \ A). 

Problem 1.1.6 Write the union A U B U C as a disjoint 
union of sets. 

Problem 1.1.7 Prove that a set with n > 0 elements has 
2 n subsets. 

Problem 1.1.8 Let (a, b) e R 2 . Prove that 
||a| - |b|| < [a — b|. 


1.2 Partitions and Equivalence Relations 

16 Definition Let S ^ 0 be a set. A partition of S is a collection of non-empty, pairwise disjoint subsets of 
S whose union is S. 


1 7 Example Let 


2Z = {. . . , —6, —4, —2, 0, 2, 4, 6, . . .} = 0 
be the set of even integers and let 

2Z + 1 ={...,-5, -3,-1, 1,3,5,...} = ! 

be the set of odd integers. Then 

(2Z) U (2Z + 1) = Z, (2Z) n (2Z + 1) = 0, 
and so {2Z, 2Z + 1 } is a partition of Z. 

18 Example Let 

3Z = {... — 9, , -6, -3, 0, 3, 6, 9, . . .} = 0 

be the integral multiples of 3, let 

3Z + 1 ={..., -8, -5, -2, 1,4, 7,...} = ! 
be the integers leaving remainder 1 upon division by 3, and let 

3Z + 2 = {. . . , —7, -4, -1 , 2, 5, 8, . . .} = 2 
be integers leaving remainder 2 upon division by 3. Then 

(3Z) U (3Z + 1 ) U (3Z + 2) = Z, 

(3Z) n (3Z + 1 ) = 0, (3Z) n (3Z + 2) = 0, (3Z + 1 ) n (3Z + 2) = 0, 
and so {3Z, 3Z + 1 , 3Z + 2} is a partition of Z. 


Notice that 0 and 1 do not mean the same in examples 1 7 and 18. Whenever we make use 
of this notation, the integral divisor must be made explicit. 

19 Example Observe 

R=(Q)U(R\Q), 0 = (Q) n (R\Q), 


which means that the real numbers can be partitioned into the rational and irrational numbers. 


Partitions and Equivalence Relations 
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20 Definition Let A, B be sets. A relation R is a subset of the Cartesian product A x B. We write the fact 
that (x, y ) E R as x ~ y . 

21 Definition Let A be a set and R be a relation on A x A. Then R is said to be 

• reflexive if (Vx E A) , x ~ x, 

• symmetric if (V(x, y) E A 2 ),x ~ y =A y ~ x, 

• anti-symmetric if (V(x, y) 6 A 2 ), (x ~ y) A (y ~ x) =A x = y, 

• transitive if (V(x,y,z) E A 3 ), (x ~ y) A (y ~ z) =4 (x~z). 

A relation R which is reflexive, symmetric and transitive is called an equivalence relation on A. A relation 
R which is reflexive, anti- symmetric and transitive is called a partial order on A. 

22 Example Let S ={A11 Human Beings}, and define ~ on S as a ~ b if and only if a and b have the same 
mother. Then a - a since any human a has the same mother as himself. Similarly, a ~ b =A b - a 
and (a ~ b) A (b ~ c) =A (a - c). Therefore ~ is an equivalence relation. 

23 Example Let L be the set of all lines on the plane and write h ~ I 2 if h III 2 (the line h is parallel to the 
line I 2 ). Then ~ is an equivalence relation on L. 

24 Example In Q define the relation § ~ ^ AA ay = bx, where we will always assume that the 
denominators are non-zero. Then ~ is an equivalence relation. For ^ ~ A since ab = ab. Clearly 

ax x a 

— =A ay = bx =A xb = y a =A — . 

by y b 

Finally, if § ~ ^ and ^ ~ | then we have ay = bx and xt = sy. Multiplying these two equalities 
ayxt = bxsy. This gives 

ayxt — bxsy = 0 =A xy(at — bs) = 0. 

Now if x = 0, we will have a = s = 0, in which case trivially at = bs. Otherwise we must have at — bs = 0 
and so - | . 

b t 

25 Example Let X be a collection of sets. Write A - B if A C B. Then ~ is a partial order on X. 

26 Example For (a, b) gl 2 define 

a ~ b A a 2 + b 2 > 2. 

Determine, with proof, whether ~ is reflexive, symmetric, and/or transitive. Is ~ an equivalence relation? 
Solution: ► Since 0 2 + 0 2 f 2, we have 0^0 and so ~ is not reflexive. Now, 

a ~ b A a 2 + b 2 
A b 2 + a 2 
A b - a, 

so ~ is symmetric. Also 0 ~ 3 since 0 2 + 3 2 > 2 and 3 ~ 1 since 3 2 + l 2 > 2. But 0^1 since 
0 2 + l 2 f 2. Thus the relation is not transitive. The relation, therefore, is not an equivalence 
relation. 
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27 Definition Let ~ be an equivalence relation on a set S. Then the equivalence class of a is defined and 
denoted by 

[a] = {x 6 S : x ~ a}. 


28 Lemma Let ~ be an equivalence relation on a set S. Then two equivalence classes are either identical 
or disjoint. 

Proof: We prove that if (a, h) E S 2 , and [a] n [b] f 0 then [a] = [b]. Suppose thatx E [a] n [b]. 

Nowx E [a] x - a =£> a ~ x, by symmetry. Similarly, x E [b] ==> x ~ b. By transitivity 

(a~x)A(x~b) =^> a - b. 

Now, if y E [b] then b ~ y . Again by transitivity, a ~ y. This means that y 6 [a]. We have shewn 
that y E [b] =/ y E [a] and so [b] C [a]. In a similar fashion, we may prove that [a] C [b]. This 
establishes the result. □ 

29 Theorem Let S f 0 be a set. Any equivalence relation on S induces a partition of S. Conversely, 
given a partition of S into disjoint, non-empty subsets, we can define an equivalence relation on S whose 
equivalence classes are precisely these subsets. 

Proof: By Lemma 28, if - is an equivalence relation on S then 

S = (J [a], 

aES 

and [a] n [b] = 0 if a b. This proves the first half of the theorem. 

Conversely, let 

S = U S«, Sa n Sp = 0 if a yp, 

(X 

be a partition of S. We define the relation « on S by letting a « b if and only if they belong to 
the same S a . Since the S a are mutually disjoint, it is clear that « is an equivalence relation on 
S and that for a E S a , we have [a] = S«. □ 


Homework 


Problem 1.2.1 For (a,b) E 
~ as follows: a - b O ■§■ E 
relation is reflexive, symmetric, and/or transitive. 


(Q\{0)) 2 define the relation 
Z. Determine whether this 


Problem 1.2.2 Give an example of a relation on Z \ {0} 
which is reflexive, but is neither symmetric nor transi- 
tive. 


Problem 1.2.3 Define the relation ~ in R by x ~ y 
is an equivalence relation. 


xe u = ye x . Prove that 


Problem 1.2.4 Define the relation 
3h E Z such that x = ^ ^ 


in Q by x - y 
[A] Prove that ~ is an 


equivalence relation. [B] Determine [x], the equivalence 
of x E Q. [C] Is | 


1.3 Binary Operations 

30 Definition Let S, T be sets. A binary operation is a function 


SxS ^ T 

(a, b) i-> (a, b) 


Binary Operations 
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We usually use the “infix” notation a 0 b rather than the “prefix” notation ®(a, b). If S = T then we say 
that the binary operation is internal or closed and if S =/ T then we say that it is external. If 

a ® b = b 0 a 

then we say that the operation ® is commutative and if 

a0(b0c) = (a0b)0c, 
we say that it is associative. If 0 is associative, then we can write 

a®(b®c) = (a0b)0c=a0b0c, 


without ambiguity. 


We usually omit the sign 0 and use juxtaposition to indicate the operation 0. Thus we write 
ab instead of a 0 b . 

31 Example The operation + (ordinary addition) on the set Z x Z is a commutative and associative closed 
binary operation. 

32 Example The operation — (ordinary subtraction) on the set N X N is a non-commutative, non- 
associative non-closed binary operation. 

33 Example The operation 0 defined by a 0 b = 1 + ab on the set Z x Z is a commutative but non- 
associative internal binary operation. For 

a0b = 1 +ab = l +ba = ba, 

proving commutativity. Also, 1 0 (2 0 3) = 1 0 (7) = 8 and (1 0 2) 0 3 = (3) 0 3 = 10, evincing 
non-associativity. 

34 Definition Let S be a set and 0:SxS— >Sbea closed binary operation. The couple (S, 0) is called 
an algebra. 

When we desire to drop the sign 0 and indicate the binary operation by juxtaposition, we 
simply speak of the “algebra S.” 

35 Example Both (Z, +) and (Q, •) are algebras. Here + is the standard addition of real numbers and • 
is the standard multiplication. 

36 Example (Z, — ) is a non-commutative, non-associative algebra. Here — is the standard subtraction 
operation on the real numbers 

37 Example (Putnam Exam, 1972) Let S be a set and let * be 

V(x,y) 6 S 2 

x * (x * y) = y, 

(y * x) * x = y . 

Shew that * is commutative, but not necessarily associative. 

Solution: ► By (1.8) 

x * y = ((x * y) * x) * x. 

By (1.8) again 

((x * y) * x) * x = ((x * y) * ((x * y) * y)) * x. 


a binary operation of S satisfying the laws 

(1.7) 

( 1 . 8 ) 
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By (1.7) 


((x*y)* ((x*y)*y))*x = (y) * x = y * x, 


which is what we wanted to prove. 


To shew that the operation is not necessarily associative, specialise S = Z and x * y = — x — y 
(the opposite ofx minus y). Then clearly in this case * is commutative, and satisfies (1.7) and 
(1.8) but 

0 * ( 0 * 1 ) = 0 * (- 0 - 1 ) = 0 * (- 1 ) =-0 - (- 1 ) = 1 , 

and 

(0 * 0 ) * 1 = (-0 - 0 ) * 1 = ( 0 ) * 1 = -0 - 1 = - 1 , 

evincing that the operation is not associative. ◄ 

38 Definition Let S be an algebra. Then l E S is called a left identity if Vs € S we have Is = s. Similarly 
r E S is called a right identity if Vs E S we have sr = s. 


39 Theorem If an algebra S possesses a left identity l and a right identity r then l = r. 


Proof: Since l is a left identity 

Since r is a right identity 
Combining these two, we gather 
whence the theorem follows. □ 


r = Ir. 

I = Ir. 
r = Ir = l, 


40 Example In (Z, +) the element 0 E Z acts as an identity, and in (Q, •) the element 1 6 Q acts as an 
identity. 

41 Definition Let S be an algebra. An element a E S is said to be left-cancellable or left- regular if V(x, y) E 
S 2 

ax = ay =$ x = y . 

Similarly, element b E S is said to be right-cancellable or right-regular if V(x, y ) E S 2 

xb = yb =$ x = y . 

Finally, we say an element c E S is cancellable or regular if it is both left and right cancellable. 


42 Definition Let (S, 0) and (S, T) be algebras. We say that T is left-distributive with respect to 0 if 

V(x,y,z) e S 3 , xT (y 0 z) = (xTy) 0 (xTz). 

Similarly, we say that T is right-distributive with respect to 0 if 

V(x,y,z) E S 3 , (y 0 z)Tx = (yTx) 0 (zTx). 

We say that T is distributive with respect to 0 if it is both left and right distributive with respect to 0. 
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n 


Problem 1.3.1 Let 

S = {x G Z : 3(a, b) G Z 2 ,x = a 3 + b 3 + c 3 — 3abc}. 

Prove that S is closed under multiplication, that is, if 
x G S and y G S then xy G S. 

Problem 1.3.2 Let (S,<g>) be an associative algebra, let 
a G S be a fixed element and define the closed binary 
operation T by 

xT y = x (g) a®y. 

Prove that T is also associative over S x S. 

Problem 1.3.3 On Qn] — 1 ; 1 [ define the a binary opera- 
tion ® 


where juxtaposition means ordinary multiplication and 
+ is the ordinary addition of real numbers. Prove that 

O Prove that ® is a closed binary operation on 
Qn] — 1; l[. 

© Prove that ® is both commutative and associative. 

© Find an element e G R such that (Va G Qn] — 
1;1[) (e <g> a = a). 

© Given e as above and an arbitrary element a G 
Qn] — 1 ; 1 [, solve the equation a® b = e for b. 


Problem 1.3.4 On R \ {1} define the a binary operation 

<£> 

a®b = a + b — ab, 

where juxtaposition means ordinary multiplication and 
+ is the ordinary addition of real numbers. Clearly ® is 
a closed binary operation. Prove that 

O Prove that ® is both commutative and associative. 

© Find an element e G R \ {1} such that (Va G 

IR. \ {1 » (e ® a = a). 

© Given e as above and an arbitrary element a G 
R \ {1 }, solve the equation a ® b = e for b. 


Problem 1.3.5 (Putnam Exam, 1971) Let S be a set 

and let o be a binary operation on S satisfying the two 
laws 

(Vx G S) (x o x = x), 

and 

(V(x, y , z) G S 3 )((xoy)oz=(yoz)ox). 

Shew that o is commutative. 

Problem 1.3.6 Define the symmetric difference of the 
sets A,B as AAB = (A \ B) U (B \ A). Prove that A 
is commutative and associative. 


1.4 Z n 

43 Theorem (Division Algorithm) Let n > 0 be an integer. Then for any integer a there exist unique integers 
q (called the quotient) and r (called the remainder) such that a = qn + r and 0 < r < q. 


Proof: In the proof of this theorem, we use the following property of the integers, called the 

well-ordering principle: any non-empty set of non-negative integers has a smallest element. 


Consider the set 


S = {a — bn : b G Z A a > bn}. 

Then S is a collection of nonnegative integers and S f 0 as ±a — 0 • n E S and this is non- 
negative for one choice of sign. By the Well-Ordering Principle, S has a least element, say r. 
Now, there must be some q 6 Z such that r = a — qn since r 6 S. By construction, r > 0. Let 
us prove that r < n. For assume that r > n. Then r>r-n=a-qn-n = a- (q + l)n>0, 
since r — n > 0. But then a-(q + l)ng S and a— (q + l)n<r which contradicts the fact that 
r is the smallest member of S. Thus we must have 0 < r < n. To prove that r and q are unique, 
assume that qrn + r! = a = q2n + r2, 0 < r! < n, 0 < r2 < n. Then r 2 — r^ = n(qi — q2), that 
is, n divides (r2 — r-| ). But \r 2 — r-j | < n, whence r2 = ri . From this it also follows that qi = q2- 
This completes the proof. □ 


44 Example If n = 5 the Division Algorithm says that we can arrange all the integers in five columns as 
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follows: 


-10 -9 -8 -7 -6 

-5 -4 -3 -2 -1 

0 12 3 4 

5 6 7 8 9 

The arrangement above shews that any integer comes in one of 5 flavours: those leaving remainder 0 
upon division by 5, those leaving remainder 1 upon division by 5, etc. We let 

5Z = —15, —10, —5, 0,5,10,15,...} = 0, 

5Z + 1 = {. . . , -14, -9, -4, 1 , 6, 11 , 1 6, . . .} = T, 

5Z + 2 = {..., -13, -8, -3, 2, 7, 12, 17, . . .} = 2, 

5Z + 3 = {..., -12, -7, -2, 3,8,13,18,...} = 3, 

5Z + 4 = {. . . , -1 1 , -6, -1 , 4, 9, 14, 1 9, . . .} = 4, 

and 

Z 5 = {0, 1,2,3, 4}. 

Let n be a fixed positive integer. Define the relation = by x = y if and only if they leave the same 
remainder upon division by n. Then clearly = is an equivalence relation. As such it partitions the set of 
integers Z into disjoint equivalence classes by Theorem 29. This motivates the following definition. 

45 Definition Let n be a positive integer. The n residue classes upon division by n are 

0 = nZ, 1 = nZ + 1 , 2 = nZ + 2, . . . , n — 1 = nZ + n — 1 . 

The set of residue classes modulo n is 

Z n = {0,T, . . . , n — 1 }. 


Our interest is now to define some sort of “addition” and some sort of “multiplication” in Z n . 

46 Theorem (Addition and Multiplication Modulo n) Let n be a positive integer. For (a, b) E (Z n ) 2 define 
a + b = r, where r is the remainder of a + b upon division by n. and a ■ b = t, where t is the remainder 
of ab upon division by n. Then these operations are well defined. 

Proof: We need to prove that given arbitrary representatives of the residue classes, we always 

obtain the same result from our operations. That is, if a = a' and b = b' then we have a + b = 
a' + b' and a • b = a' • b'. 


a = a' 3(q, q') E Z 2 , r E N, a = qn + r, a' = q'n + r, 0 < r < n, 

b = b / 3(qi , q{ ) € Z 2 ,ri E N, b = qin + r -\ , b' = qjn + ri , 0 < ri < n. 


Now 


11 


z 


n 


Hence 

a + b = (q + qi)n + r + r 1t a' + b' = (q' + q 1 , )n + r + n , 
meaning that both a + b and a' + b ' leave the same remainder upon division by n, and therefore 

a + b = a + b = a / + b , = a 7 + b / . 


Similarly 

ab = (qq!n + c^r^ + rq! )n + rri , a'b' = (q'qjn + q'^ + rq^n + rr -[ , 
and so both ab and a'b' leave the same remainder upon division by n, and therefore 

a • b = ab = a'b' = a' • b'. 

This proves the theorem. □ 


47 Example Let 

Z 6 ={0,T,2,3,4,5} 

be the residue classes modulo 6. Construct the natural addition + table for Also, construct the 
natural multiplication • table for Z 6 . 


Solution: ► The required tables are given in tables 1.1 and 1.2. ◄ 


• 

0 

1 

2 

3 

4 

5 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

2 

3 

4 

5 

2 

0 

2 

4 

0 

2 

4 

3 

0 

3 

0 

3 

0 

3 

4 

0 

4 

2 

0 

4 

2 

5 

0 

5 

4 

3 

2 

1 


+ 

0 

1 

2 

3 

4 

5 

0 

0 

1 

2 

3 

4 

5 

1 

1 

2 

3 

4 

5 

0 

2 

2 

3 

4 

5 

0 

1 

3 

3 

4 

5 

0 

1 

2 

4 

4 

5 

0 

1 

2 

3 

5 

5 

0 

1 

2 

3 

4 


Table 1.1: Addition table for Zg . 


Table 1.2: Multiplication table for Z&. 


We notice that even though 2 f 0 and 3 f 0 we have 2 • 3 = 0 in Z (v This prompts the following 
definition. 

48 Definition (Zero Divisor) An element a f 0 of Z n is called a zero divisor if ab = 0 for some b 6 Z n . 

We will extend the concept of zero divisor later on to various algebras. 

49 Example Let 

Z 7 = {0j,2,3,4,5,6j 

be the residue classes modulo 7. Construct the natural addition + table for Z 7 . Also, construct the 
natural multiplication • table for Z 7 
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• 

0 

1 

2 

3 

4 

5 

6 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

2 

3 

4 

5 

6 

2 

0 

2 

4 

6 

1 

3 

5 

3 

0 

3 

6 

2 

5 

1 

4 

4 

0 

4 

1 

5 

2 

6 

3 

5 

0 

5 

3 

1 

6 

4 

2 

6 

0 

6 

5 

4 

3 

2 

1 


+ 

0 

1 

2 

3 

4 

5 

6 

0 

0 

1 

2 

3 

4 

5 

6 

1 

1 

2 

3 

4 

5 

6 

0 

2 

2 

3 

4 

5 

6 

0 

1 

3 

3 

4 

5 

6 

0 

1 

2 

4 

4 

5 

6 

0 

1 

2 

3 

5 

5 

6 

0 

1 

2 

3 

4 

6 

6 

0 

1 

2 

3 

4 

5 


Table 1.3: Addition table for Z 7 . 


Table 1.4: Multiplication table for Z 7 . 


Solution: ► The required tables are given in tables 1.3 and 1.4. ◄ 


50 Example Solve the equation 


in Zi 1 . 


5x = 3 


Solution: ► Multiplying by 9 on both sides 

45x = 27, 


that is, 

x = 5. 

◄ 

We will use the following result in the next section. 

51 Definition Let a, b be integers with one of them different from 0. The greatest common divisor d of 
a, b, denoted by d = gcd(a, b) is the largest positive integer that divides both a and b. 

52 Theorem (Bachet-Bezout Theorem) The greatest common divisor of any two integers a, b can be writ- 
ten as a linear combination of a and b, i.e., there are integers x, y with 

gcd (a, b) = ax + by. 


Proof: Let A = {ax + by : ax + by > 0, x, y E Z}. Clearly one of ± a, ±b is in A, as one of a, b is 

not zero. By the Well Ordering Principle, A has a smallest element, say d. Therefore, there are 
x 0 ,yo such that d = ax 0 + byo- We prove that d = gcd(a, b). To do this we prove that d divides 
a and b and that if t divides a and b , then t must also divide then d. 

We first prove that d divides a. By the Division Algorithm, we can find integers q,r,0 < r < d 
such that a = dq + r. Then 


r = a — dq = a(1 — qx 0 ) — by 0 . 


Fields 
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If r > 0, then r E A is smaller than the smaller element of A, namely d, a contradiction. Thus 
r = 0. This entails dq = a, i.e. d divides a. We can similarly prove that d divides b. 

Assume that t divides a and b. Then a = tm, b = t nfor integers m, n. Hence d = axo + bx 0 = 
t(mx 0 + nyo), that is, t divides d. The theorem is thus proved. □ 


Homework 


Problem 1.4.1 Write the addition and multiplication ta- 
bles of Zi i under natural addition and multiplication 
modulo 1 1 . 

Problem 1.4.2 Solve the equation 3x 2 — 5x + 1 = 0 in 

Zi i . 

Problem 1.4.3 Solve the equation 
5x 2 =3 


inZn- 


Problem 1.4.4 Prove that if n > 0 is a composite integer, 
Z n has zero divisors. 


Problem 1.4.5 How many solutions does the equation 
x 4 + x 3 + x 2 + x + T = 0 have in Zi i ? 


1.5 Fields 

53 Definition Let F be a set having at least two elements Of and 1 F (Of f If) together with two operations 
• (multiplication, which we usually represent via juxtaposition) and + (addition). Afield (F,*,+) is a 
triplet satisfying the following axioms V(a, b, c) E F 3 : 

FI Addition and multiplication are associative: 

(a + b) + c = a + (b + c), (ab)c = a(bc) (1.9) 


F2 Addition and multiplication are commutative: 

a + b = b + a, ab = ba 

F3 The multiplicative operation distributes over addition: 

a(b + c) = ab + ac 


F4 Op is the additive identity: 

F5 1 f is the multiplicative identity: 


0F + a= a + 0F = a 


lpCl = 0.1 F = O 


F6 Every element has an additive inverse: 

3 — a E F, a + (—a) = (—a) + a = 0 F 


F7 Every non-zero element has a multiplicative inverse: if a f Of 

3a _1 E F, aa 1 = a -1 a = 1 F 


( 1 . 10 ) 

( 1 . 11 ) 

( 1 . 12 ) 

(1.13) 

(1.14) 

(1.15) 


The elements of a field are called scalars. 


An important property of fields is the following. 


54 Theorem A field does not have zero divisors. 
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Proof: Assume that ub = 0 .-. If a =f 0 - then it has a multiplicative inverse a -1 . We deduce 

a~ 1 ab = a~ 1 0F =^> b = Of. 

This means that the only way of obtaining a zero product is if one of the factors is Of- □ 

55 Example (<Q>, •,+), (R, •,+), and (C, •,+) are all fields. The multiplicative identity in each case is 1 
and the additive identity is 0. 


56 Example Let 

and define addition on this set as 


Qtv^) ={a + V2b : (a,b) e Q 2 } 


(a + V2b) + (c + V2d) = (a + c) + V2(b + d), 


and multiplication as 

(a + Vlb){c + Vld) = (ac + 2bd) + V2(a d + be). 

Then (Q + V2Q, , +) is a field. Observe Of = 0, 1 F = 1 , that the additive inverse of a + \/2b is —a — Vlb, 
and the multiplicative inverse of a + Vlb, (a, b) f (0,0) is 


(a + VIb)- 1 


1 a — Vlb a Vlb 

a + Vlb a 2 — 2b 2 a 2 — 2b 2 a 2 — 2b 2 


Here a 2 — 2b 2 f 0 since y/2 is irrational. 


57 Theorem If p is a prime, (Z p , •, +) is a field under • multiplication modulo p and + addition modulo 
P- 

Proof: Clearly the additive identity is 0 and the multiplicative identity is 1 . The additive 

inverse of a is p a. We must prove that every agZ p \ {0} has a multiplicative inverse. Such 
an a satisfies gcd( a, p) = 1 and by the Bachet-Bezout Theorem 52, there exist integers x, y with 
px + ay = 1 . In such case we have 


1 = px + ay = ay = a • y, 


whence (a) 1 = y. □ 

58 Definition A field is said to be of characteristic p y 0 if for some positive integer p we have Va 6 F, pa = 
Of, and no positive integer smaller than p enjoys this property. 


If the field does not have characteristic p =f 0 then we say that it is of characteristic 0. Clearly Q, R and 
C are of characteristic 0, while Z p for prime p, is of characteristic p. 

59 Theorem The characteristic of a field is either 0 or a prime. 


Proof: If the characteristic of the field is 0, there is nothing to prove. Let p be the least positive 

integer for which V u 6 F, pa = Of- Let us prove that p must be a prime. Assume that instead we 
had p = st with integers s > 1 , t > 1 . Take a - 1 p . Then we must have (st)1p = Of, which entails 
(si f) (tip) = Op. But in afield there are no zero-divisors by Theorem 54, hence either si f = Oh or 
tip = Of- But either of these equalities contradicts the minimality of p. Hence p is a prime. □ 


Functions 
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Homework 

Problem 1.5.1 Consider the set of numbers 

®{V2, VI, V6) = (a + b\/2 + ca/3 + dV& : (a, b, c, d) e Q 4 }. 

Assume that Q(\/2, \/3, v / 6 ) is a field under ordinary addition and multiplication. What is the multiplicative inverse 
of the element V~2 + 2V3 + 

Problem 1.5.2 Let F be a field and a, b two non-zero elements of F. Prove that 

— (ab _1 ) = (-a)b- 1 = a(-b _1 ). 

Problem 1.5.3 Let F be a field and a f Of- Prove that 

(-a )- 1 =-(a- 1 ). 

Problem 1.5.4 Let F be a field and a, b two non-zero elements of F. Prove that 

ab - 1 = ( a)(— b 1 ). 


1.6 Functions 

60 Definition By a function or a mapping from one set to another, we mean a rule or mechanism that 
assigns to every input element of the first set a unique output element of the second set. We shall call 
the set of inputs the domain of the function, the set of possible outputs the target set of the function, 
and the set of actual outputs the image of the function. 

We will generally refer to a function with the following notation: 


D -» T 

f : 


x i — ) f(x) 


Here f is the name of the function, D is its domain, T is its target set, x is the name of a typical input 
and f (x) is the output or image ofx under f. We call the assignment x i-4 f(x) the assignment rule of the 
function. Sometimes x is also called the independent variable. The set f(D) = {f(a)|a E D} is called the 
image of f. Observe that f (D) C T. 



Figure 1.4: An injection. 


Figure 1.5: Not an injection 


X — > Y 

61 Definition A function f : is said to be injective or one-to-one if V(a, b) E X 2 , we have 

X 1-4 f(x) 


afb =^> f(a) f f(b). 
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This is equivalent to saying that 


f(a) =f(b) =4 a = b. 


62 Example The function a in the diagram 1.4 is an injective function. The function |3 represented by 
the diagram 1.5, however, is not injective, (3(3) = (3(1 ) = 4, but 3 =/! . 


63 Example Prove that 


is an injection. 


M\{1} 

t : 

x 


-> K\{1} 

x + 1 

(-4 

X — 1 


Solution: ► Assumet(a) = t(b). Then 


t(a) = t(b) 


We have proved that t(a) = t(b) 


=4 


=4 


=4 a 


a + 1 b + 1 

a — 1 b — 1 

(a + 1 ) (b — 1 ) = (b + 1 )(a — 1 ) 

ab — a + b — 1 = ab — b + a — 1 

2a = 2b 

a = b 

= b , which shews that t is injective. ◄ 



Figure 1.6: A surjection 



64 Definition A function f : A — > B is said to be surjective or onto if (Vb 6 B) (3a 6 A) : f(a) = b. That is, 
each element of B has a pre-image in A. 


A function is surjective if its image coincides with its target set. It is easg to see that a 
graphical criterion for a function to be surjective is that every horizontal line passing through a 
point of the target set (a subset of the y -axis) of the function must also meet the curve. 

65 Example The function |3 represented by diagram 1.6 is surjective. The function y represented by 
diagram 1.7 is not surjective as 8 does not have a preimage. 


1-4 


66 Example Prove that t : 


x 


is a surjection. 


Functions 
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Solution: ► Since the graph oftis that of a cubic polynomial with only one zero , every horizontal 
line passing through a point in R will eventually meet the graph of g , whence t is surjective. To 
prove this analytically, proceed as follows. We must prove that (V b E R) (3a) such that 
t(a) = b. We choose a so that a = b 1/3 . Then 

t(a) =t(b 1/3 ) = (b 1/3 ) 3 = b. 

Our choice of a works and hence the function is surjective. ◄ 

67 Definition A function is bijective if it is both injective and surjective. 


Homework 


Problem 1.6.1 Prove that 


is an injection. 

Problem 1.6.2 Shew that 


is a bijection. 


h : 


R 


X 



X 


R\{3} 

6x 

2x — 3 


Chapter 


Matrices and Matrix Operations 


2. 1 The Algebra of Matrices 

68 Definition Let (F, • , +) be a field. An m x n (m by n) matrix A with m rows and n columns with entries 
over F is a rectangular array of the form 


2 




an 

an • 

* 0-1 n 


A = 

a 2 i 

a 22 • 

* &2n 



&m1 CL m 2 * 

* Cl mn 

where 6 {1,2, . 

. ,m} x (1,2,. . . ,n), 

atj e f. 



As a shortcut, we often use the notation A = [ay] to denote the matrix A with entries ay . 
Notice that when we refer to the matrix we put parentheses — as in lay],” and when we refer to 
a specific entry we do not use the surrounding parentheses — as in “atj.” 


69 Example 


is a 2 x 3 matrix and 



1 

3 


-2 1 


B = 1 


2 


0 3 


is a 3 x 2 matrix. 


70 Example Write out explicitly the 4x4 matrix A = [ap] where a,j = i 2 — j 2 . 
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Solution: ► This is 


I 2 -! 1 I 2 — 2 2 l 2 — 3 2 l 2 — 4 2 


0 -3 -8 -15 

2 2 — I 2 2 2 — 2 2 2 2 — 3 2 2 2 — 4 2 


3 0-5 -12 

3 2 — I 2 3 2 — 2 2 3 2 — 3 2 3 2 — 4 2 


8 5 0 -7 

4 2 - l 2 4 2 — 2 2 4 2 — 3 2 4 2 — 4 2 


15 12 7 0 


71 Definition Let (F, •, +) be a field. We denote by M mxn (F) the set of all m x n matrices with entries 
over F. M nxn (F) is, in particular, the set of all square matrices of size n with entries over F. 


72 Definition The m x n zero matrix 0 mxn E M mxn (F) is the matrix with Of’s everywhere. 


Of Of Of 
Of Of Of 
Of Of Of 

Of Of Of 


Of 

Of 

Of 


When m = n we write O n as a shortcut for 0, 


73 Definition The n x n identity matrix I n 6 M nxn (F) is the matrix with l F ’s on the main diagonal and 
Op’s everywhere else. 


If Of Of 
Of If Of 
Of Of If 

Of Of Of 


Of 

Of 

Of 


74 Definition (Matrix Addition and Multiplication of a Matrix by a Scalar) Let A = [ay] e M mxn (F), B = 
[by] E M mxn (F) and a E F. The matrix A + aB is the matrix C E M mxn (F) with entries C = [cy] where 
Ci j = ay T ab y . 
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75 Example For A = 


1 

1 


-1 

1 

-1 

1 

and B = 

2 

1 

0 

2 


0 

-1 


we have 


A + 2B 


-1 3 
3 3 
0 0. 


76 Theorem Let (A, B, C) <E (M mxn (F)) 3 and [a, |3) <E F 2 . Then 
Ml M m x n ( F ) is close under matrix addition and scalar multiplication 


A + B 6 M mxn (F), aA 6 M mxn (F) 


( 2 . 1 ) 


M2 Addition of matrices is commutative 


M3 Addition of matrices is associative 


A + B = B + A 


A + (B + C) = (A + B) +C 


( 2 . 2 ) 


(2.3) 


M4 There is a matrix 0 mxn such that 
M5 There is a matrix —A such that 


A + 0 


mxn 


(2.4) 



A + (—A) = (—A) + A = 0 mX n 

(2.5) 

M6 Distributive law 




a(A + B) = aA + aB 

(2.6) 

M7 Distributive law 




( a H - |3 ) A = a A -(- |3 B 

(2.7) 

M8 




1 F A = A 

(2.8) 

M9 




a(|3A) = (a|3)A 

(2.9) 


Proof: The theorem follows at once by reducing each statement to an entry-wise and appealing 

to the field axioms. □ 


Homework 


Problem 2.1.1 Write out explicitly the 3x3 matrix A = [hi;] where ciu = V . 


Problem 2.1.2 Write out explicitly the 3x3 matrix A = [aij] where atj = ij. 
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Problem 2.1.3 Let 


a 

-2a 

c 


1 

2a 

c 

0 

—a 

b 

, N = 

a 

b - a 

-b 

a + b 

0 

-1 


a-b 

0 

-1 


be square matrices with entries over K. Find M + N and 2M. 


Problem 2.1.4 Determine x and y such that 


3 x 1 


2 1 3 


7 3 7 


+ 2 


= 


1 2 0 


5x4 


11 V 8 


Problem 2.1.5 Determine 2x2 matrices A and B such that 


1 -2 


4 2 


, -2A + 6B = 


0 1 


6 0 


Problem 2.1.6 Let A = [ay] £ M nxn (R). Prove that 


min max ay > max min ay . 

! i i i 


Problem 2.1.7 A person goes along the rows of a movie theater and asks the tallest person of each row to stand 
up. Then he selects the shortest of these people, who we will call the shortest giant. Another person goes along the 
rows and asks the shortest person to stand up and from these he selects the tallest, which we will call the tallest 
midget Who is taller, the tallest midget or the shortest giant? 


Problem 2.1.8 (Putnam Exam, 1959) Choose five elements from the matrix 


11 

17 

25 

19 

16 

24 

10 

13 

15 

3 

12 

5 

14 

2 

18 

23 

4 

1 

8 

22 

6 

20 

7 

21 

9 


no two coming from the same row or column, so that the minimum of these five elements is as large as possible. 
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2.2 Matrix Multiplication 


77 Definition Let A = [atj] 6 M mxn (F) and B = [btj] 6 M nxp (F). Then the matrix product AB is defined 
as the matrix C = [c-tj] E M mxp (F) with entries Cij = dubij: 


an 

a 12 • • 

• av 

a 2 i 

a 22 • • 

• a 2 


aii a i2 


1 a m 2 


bn 

• bij • 

• • b v 

b 21 

• • b 2j • 

• • b 2 


hnl 




7 HP 


Cll 

C21 


Cl p 
C2p 


Cml 


-'Trip 


Observe that we use juxtaposition rather than a special symbol to denote matrix multipli- 
cation. This will simplify notation.In order to obtain the ij -th entry of the matrix AB we multiply 
elementwise the i -th row of A by the j -th column of B. Observe that AB isamxp matrix. 


78 Example Let M 


1 2 


5 6 


and N = 


3 4 


7 8 


be matrices over R. Then 


MN = 


and 


NM = 


1 2 


5 6 


1 -5 + 2-7 1 -6 + 2-8 


19 22 

3 4 


7 8 


3 • 5 + 4 • 7 3 • 6 + 4 • 8 


43 50 

5 6 


1 2 


5-1 +6-3 5 • 2 + 6 • 4 


23 34 

7 8 


3 4 


7-1 +8-3 7 • 2 + 8 • 4 


31 46 


Hence, in particular, matrix multiplication is not necessarily commutative. 

79 Example We have 


r 




1 

1 

1 


1 

1 

1 


1 

1 

1 



2 

-1 

-1 


0 

0 

0 

-1 

2 

-1 

= 

0 

0 

0 

-1 

-1 

2 


0 

0 

0 


over R. Observe then that the product of two non-zero matrices may be the zero matrix. 
80 Example Consider the matrix 


A = 


2 1 3 
0 T T 
4 4 0 


Matrix Multiplication 
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with entries over Z 5 . Then 



2 1 3 


2 1 3 

A 2 = 

0 1 1 


0 1 1 


4 4 0 


4 4 0 


1 0 2 


4 0 


1 ■ 


3 3 1 


Even though matrix multiplication is not necessarily commutative, it is associative. 

81 Theorem If (A, B,C) € M mxn (F) x M nxr (F) x M rxs (F) we have 

(AB)C = A(BC), 

i.e., matrix multiplication is associative. 

Proof: To shew this we only need to consider the ij-th entry of each side, appeal to the 

associativity of the underlying field F and verify that both sides are indeed equal to 

n r 

II Clikbkk'Cic'j. 

k=1 k'=1 


□ 

By virtue of associativity, a square matrix commutes with its powers, that is, if A 6 
M nxn (F), and (r, s) € N 2 , then (A r )(A s ) = (A s )(A r ) = A r+S . 

82 Example Let A 6 M 3 X 3 (K) be given by 


A = 


1 1 1 
1 1 1 
1 1 1 


Demonstrate, using induction, that A n = 3 n 1 A for n £ N, n > 1 . 


Solution: ► The assertion is trivial for n = 1 . Assume its truth for n — 1 , that is, assume 
A n 1 = 3 n 2 A. Observe that 


A 2 = 


1 1 1 


1 1 1 


3 3 3 

1 1 1 


1 1 1 

= 

3 3 3 

1 1 1 


1 1 1 


3 3 3 


= 3A. 


Now 

A n = AA n_1 = A(3 n ~ 2 A) = 3 n ~ 2 A 2 = 3 n ~ 2 3A = 3 n_1 A, 
and so the assertion is proved by induction. ◄ 
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83 Theorem Let A E M nxn (F). Then there is a unique identity matrix. That is, if E E M nxn (F) is such 
that AE = EA = A, then E = I n . 

Proof: It is clear that for any A E M nxn (F), AI n = I n A = A. Now because E is an identity, 

EI n = I n . Because I n is an identity, EI n = E. Whence 

I n = EI n = E, 


demonstrating uniqueness. □ 

84 Example Let A = [dq] E M nxn (R) be such that ciq = 0 for i > ) and aq = 1 if i < j. Find A 2 . 
Solution: ► Let A 2 = B = [bq]. Then 


n 

bq = ) a ik a kj . 
k=1 

Observe that the i-th row of A has i — 1 0 ’s followed by n — i + 1 1 ’s, and the j -th column of A 
has j 1 ’s followed by n j 0’s. Therefore ifi — 1 > ), then bq = 0. If i < j + 1 , then 


This means that 


i 

bq = Y_ a ika kj = j — i + 1 . 
k=i 


1 2 3 4 ••• n — 1 n 

0 1 2 3 ••• n — 2 n — 1 


A 2 


0 0 


2 


n — 3 n — 2 


◄ 


0 0 0 0 


2 


0 0 0 0 


0 1 


Homework 


Problem 2.2.1 Determine the product 


1 -1 


-2 1 


i 1 

1 i 


0 -1 


i 2 



1 

0 

0 


a 

b 

c 


Problem 2.2.2 Let A = 

1 

1 

0 

, B = 

c 

a 

b 

. Find AB and BA. 


1 

1 

1 


b 

c 

a 
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1 

2 

3 


1 

1 

1 


a 

a 

a 

Problem 2.2.3 Find a + b + c if 

2 

3 

1 


2 

2 

2 

= 

b 

b 

b 


3 

1 

2 


3 

3 

3 


c 

c 

c 


Problem 2.2.4 Let N 


0 -2 -3 -4 

0 0 -2-3 

0 0 0 -2 
0 0 0 0 


. Find N 3 


Problem 2.2.5 Let 











2 

3 

4 

T 


T 

T 

T 

T 

T 

2 

3 

4 

, B = 

T 

T 

T 

T 

4 

T 

2 

3 


T 

T 

T 

T 

3 

4 

T 

2 


T 

T 

T 

T 


be matrices in Mi x 4 ( Z5 ) . Find the products AB and BA. 


Problem 2.2.6 Let x be a real number, and put 


1 0 x 

-x 1 -- 

0 0 1 


If a, b are real numbers, prove that 

1. m(a)m(b) = m(a + b). 

2. m(a)m(— a) = I3, the 3x3 identity matrix. 


Problem 2.2.7 A square matrix X is called idempotent if X 2 = X. Prove that if AB = A and BA = B then A and B 
are idempotent. 


Problem 2.2.8 Let 


Calculate the value of the infinite series 



2 

0 

0 


0 

0 

2 - 


I 3 + A + A 2 + A 3 + • • • . 


2 


-4 x 

z. 

-1 

0 

-x 4 


0 

-1 


Problem 2.2.9 Solve the equation 
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over R. 

Problem 2.2.10 Prove or disprove! If (A, B) g (M nxn (F )) 2 are such that AB = 0 n , then also BA = 0 n . 

Problem 2.2.11 Prove or disprove! For all matrices (A,B) g (M nX n(F)) 2 , 

(A + B) (A — B) = A 2 -B 2 . 


Problem 2.2.12 Consider the matrix A = 


non-zero 2x2 matrices B such that AB = 


where x Is a real number. Find the value of x such that there are 


x 


0 

0 


Problem 2.2.13 Prove, using mathematical induction, that 


- 


n 

r -i 

1 

1 


l 

n 

0 

1 


0 

1 


Problem 2.2.14 Let M = 


1 

-1 


-1 

1 


Find M 6 . 


Problem 2.2.15 Let A = 


0 

2 


3 

0 


Find, with proof, A 2003 . 


Problem 2.2.16 Let (A, B,C) g Mi xm (F) X M mxn (F) x M,n Xn (F) and a g F. Prove that 

A(B + C) = AB + AC, 

(A + B)C = AC + BC, 
a(AB) = (otA)B = A(ctB). 


Problem 2.2.17 Let A g M 2 X 2 (R) be given by 


cos a 


A = 


sin a 

Demonstrate, using induction, that for n g N, n > 1 . 


— sin a 
cos a 


A n 


cosna — sinna 
sin rux cosna 


Problem 2.2.18 A matrix A = [cnj] G M n xn(R) is said to be checkered if tUj =0 when (j — i) is odd. Prove that the 
sum and the product of two checkered matrices is checkered. 
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Problem 2.2.19 Let A £ M 3 X 3 ( 


A = 


1 1 1 
0 1 1 
0 0 1 


Prove that 


A n = 


1 n 
0 1 
0 0 


n(n+1 ) 


n 


Problem 2.2.20 Let (A,B) £ (M n xn(F )) 2 and k be a positive integer such that A k = O n . If AB = B prove that 
B=O n . 


Problem 2.2.21 Let A > 


a b 
c d 


. Demonstrate that 


A 2 - (a + d)A + (ad — bc)I 2 = 0 2 


Problem 2.2.22 Let A £ M 2 (F) and let k £ Z, k > 2. Prove that A k = 0 2 if and only if A 2 = 0 2 . 

Problem 2.2.23 Find all matrices A £ M 2x2 (R) such that A 2 = 0 2 

Problem 2.2.24 Find all matrices A £ M 2x2 (R) such that A 2 = I 2 

Problem 2.2.25 Find a solution X £ M 2x2 (R) for 


X 2 — 2X 


-1 0 
6 3 


Problem 2.2.26 Find, with proof, a 4 x 4 non-zero matrix A such that 


■ 













" 

1 

1 

1 

1 


1 

1 

1 

1 


0 

0 

0 

0 

1 

1 

1 

1 


1 

1 

1 

1 

A = 

0 

0 

0 

0 

1 

1 

1 

1 


1 

1 

1 

1 


0 

0 

0 

0 

1 

1 

1 

1 


1 

1 

1 

1 


0 

0 

0 

0 


Problem 2.2.27 Let X be a 2 x 2 matrices with real number entries. Solve the equation 


1 1 


X 2 + X = 


1 1 
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Problem 2.2.28 Prove, by means of induction that for the following n x n we have 

3 

1 1 1 


0 1 1 
0 0 1 


0 0 0 ••• 1 000 


1 3 6 

0 1 3 

0 0 1 


n(n+l ) 


(n— 1 ) n 

2 

(n _2)(n— 1) 


Problem 2.2.29 Let 

1 -1 

A= _1 1 _1 

1 -1 1 

Conjecture a formula for A n and prove it using induction. 


2.3 Trace and Transpose 

85 Definition Let A = [dp] £ M nxn (F). Then the trace of A, denoted by tr (A) is the sum of the diagonal 
elements of A, that is 

n 

tr (A) = Y_ a Kk- 

k=1 


86 Theorem Let A = [a tj ] £ M nxn (F),B = [b tJ ] € M nxn (F). Then 

tr(A + B) = tr (A) + tr (B) , 
tr(AB) = tr (BA) . 


( 2 . 10 ) 

( 2 . 11 ) 


Proof: The first assertion is trivial. To prove the second, observe that AB = (Xo<=i u ikb k j ) and 

BA= (ILi b ik a kj ). Then 

n n n n 

tr (AB) = Y_ Y- aikbkl = Y- Y. bktCLlk = tr ( BA ) » 

i=1 k=l k=1 i=l 

whence the theorem follows. □ 

87 Example Let A £ M nxn (R). Shew that A can be written as the sum of two matrices whose trace is 
different from 0. 

Solution: ► Write 

A = (A — al n ) + al n . 

tr (A) 

Now, tr (A — al n ) = tr (A) — na and tr (al n ) = na. Thus it suffices to take a f , a f 0. 

n 

Since R has infinitely many elements, we can find such an a. 
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88 Example Let A, B be square matrices of the same size and over the same field of characteristic 0. Is 
it possible that AB — BA = I n ? Prove or disprove! 


Solution: ► This is impossible. For if, taking traces on both sides 

0 = tr (AB) - tr (BA) = tr (AB - BA) = tr (I n ) = n 
a contradiction, since n > 0. ◄ 

89 Definition The transpose of a matrix of a matrix A = [aq] £ M mxn (F) is the matrix A T = B = [bq] £ 
M n xm(F), where bq = a n . 


90 Example If 


with entries in K, then 


M = 


M 1 = 


a b c 
d e f 
g h i 

a d g 
b e h 
c f i 


91 Theorem Let 

A = [aq] € M mxn (F), B = [bq] £ M mxn (F), C = [cq] £ M nxr (F), a £ F,u £ N. 

Then 

A 71 " = A, 

( A + aB ) T = A T + aB T , 

(AC) t = c t a t , 

(A u ) t = (A t ) u . 


( 2 . 12 ) 

(2.13) 

(2.14) 

(2.15) 


Proof: The first two assertions are obvious, and the fourth follows from the third by using 

induction. To prove the third put A r = (aq), aq = aq, C T = (yq), yq = Cq, AC = (uq) and 
C t A t = (vq). Then 

n id n 

Uq = Y_ atkCicj = Y_ “kiYik = Y_ Tjkaici = Vq, 
k=1 k=1 k=1 

whence the theorem follows. □ 

92 Definition A square matrix A £ M nxn (F) is symmetric if A T = A. A matrix B £ M nxn (F) is skew- 
symmetric if B t = — B. 

93 Example Let A, B be square matrices of the same size, with A symmetric and B skew-symmetric. 
Prove that the matrix A 2 BA 2 is skew- symmetric. 

Solution: ► We have 

(A 2 BA 2 ) t = (A 2 ) t (B) t (A 2 ) t = A 2 (-B)A 2 = -a 2 ba 2 . 


◄ 
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94 Theorem Let F be a field of characteristic different from 2. Then any square matrix A can be written 
as the sum of a symmetric and a skew- symmetric matrix. 

Proof: Observe that 

(A + A t ) t = A t + A 1 ” 1 " = A t + A, 
and so A + A T is symmetric. Also, 

(A — A t ) t = A t - A 1-1 " = -(A- A t ), 
and so A — A T is skew-symmetric. We only need to write A as 

A = (2- 1 )(A + A t ) + (2- 1 )(A-A t ) 


to prove the assertion. □ 

95 Example Find, with proof, a square matrix A with entries in TLz such A is not the sum of a symmetric 
and and anti-symmetric matrix. 

Solution: ► In Z 2 every symmetric matrix is also anti-symmetric, since —x = x. Thus it is enough 


to take a non-symmetric matrix, for example, take A = 


0 1 
0 0 


◄ 


Homework 


Problem 2.3.1 Write 

1 2 3 

A = 2 3 1 G M 3x b( 

3 1 2 

as the sum of two 3x3 matrices Ei , E 2 , with tr (E 2 ) = 1 0. 

Problem 2.3.2 Give an example of two matrices A G 
M 2 x 2 (R) and B G M 2 x 2 (F) that simultaneously satisfy 
the following properties: 


1. Ay 

0 0 

and B f 

0 0 


0 0 


0 0 



0 

0 


0 

0 

2. AB = 

0 

0 

and BA = 

0 

0 


3. tr (A) = tr (B) =2. 

4. A = A T and B = B T . 

Problem 2.3.3 Shew that there are no matrices 


(A,B,C,D) G (Mnxn(R)) such that 
AC t DB = In, 


Problem 2.3.4 Let (A, B) G ( M 2 x 2 (R) ) 2 be symmetric 
matrices. Must their product AB be symmetric? Prove 
or disprove! 

Problem 2.3.5 Given square matrices (A,B) G 
(M 7 X 7 (R)) 2 such that tr (A 2 ) = tr (B 2 ) = 1, and 

(A-B) 2 = 3I 7 , 

find tr (BA). 


Problem 2.3.6 Consider the matrix A = 


a b 


c d 


M 2 x 2 (R) ■ Find necessary and sufficient conditions on 
a, b, c, d so that tr (A 2 ) = (tr (A) ) 2 . 


Problem 2.3.7 Given a square matrix A G M 4 X 4 ( 
such that tr (A 2 ) = —4, and 


(A-I4) 2 =314, 


find tr (A). 


Problem 2.3.8 Prove or disprove! If A, B are square 
matrices of the same size, then it is always true that 
tr ( AB) = tr (A) tr (B). 


CA + BD = 0 n . 


Problem 2.3.9 Prove or disprove! If (A,B,C) G 
(M 3X 3 (F)) 3 then tr (ABC) =tr(BAC). 
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Problem 2.3.10 Let A be a square matrix. Prove that 
the matrix AA T is symmetric. 

Problem 2.3. 1 1 Let A, B be square matrices of the same 
size, with A symmetric and B skew- symmetric. Prove 
that the matrix AB — BA is symmetric. 


tr(AA T )=L " 1 Lr-i a 0 ' 

Problem 2.3.13 Let X £ M n x n ( 

0 n then X = 0 n • 


) . Prove that if XX T = 


Problem 2.3.14 Let m, n, p be positive integers and A £ 
Mmxn(R), B £ Mnxp(R), C £ M pxm (R). Prove that 
(BA) t A = (CA) t A BA = CA. 


Problem 2.3.12 Let A £ M n xn(F), A = [dp]. Prove that 

2.4 Special Matrices 

96 Definition The main diagonal of a square matrix A = [aq] 6 M nxn (F) is the set {a« : i < n}. The 

counter diagonal of a square matrix A = [aq] 6 M nxn (F) is the set {a l n l+1 : i < n}. 

97 Example The main diagonal of the matrix 

0 1 5 

A= 3 2 4 
9 8 7 

is the set {0, 2, 7}. The counter diagonal of A is the set {5,2, 9}. 

98 Definition A square matrix is a diagonal matrix if every entry off its main diagonal is Op. 

99 Example The matrix 


A = 


1 0 0 
0 2 0 
0 0 3 


is a diagonal matrix. 

100 Definition A square matrix is a scalar matrix if it is of the form al n for some scalar a. 

101 Example The matrix 


A = 


4 0 0 
0 4 0 
0 0 4 


= 4I 3 


is a scalar matrix. 


102 Definition A £ M mxn (F) is said to be upper triangular if 

(V(i , )) 6 (1,2,- • • , n} 2 ) , (i > j, aq = 0 F ), 

that is, every element below the main diagonal is Op. Similarly, A is lower triangular if 

(V(i, j) € 0,2,- •• , n} 2 ) , (i < j, aq = 0 F ), 
that is, every element above the main diagonal is 0 F . 
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103 Example The matrix A £ M 3 X 4 (R) shewn is upper triangular and B £ M 4 X 4 (R) is lower triangular. 


A = 


1 a b c 
0 2 3 0 
0 0 0 1 


B = 


10 0 0 
1 a 0 0 

0 2 3 0 

1 1 t 1 


104 Definition The Kronecker delta 6 y is defined by 


| 1 F if i j 
[ 0 F ifif'j 


105 Definition The set of matrices Eij £ M mxn (F),Eij = (e rs ) such that ey = Ip and eyy = Op, f 

(i, j) is called the set of elementary matrices. Observe that in fact e rs = Sir&sj- 

Elementary matrices have interesting effects when we pre-multiply and post-multiply a matrix by them. 


106 Example Let 


Then 






1 


0 

0 

0 



1 

2 

3 

4 












0 

0 

1 


A = 

5 

6 

7 

8 

, E 23 = 











0 

0 

0 



9 

10 

11 

12 












0 

0 

0 



0 

0 

0 

0 


0 

0 

2 

E 23 A = 

9 

10 

11 

12 

, ae 23 = 


0 

0 

6 


0 

0 

0 

0 


0 

0 

10 


107 Theorem (Multiplication by Elementary Matrices) Let Ey £ M mxn (F) be an elementary matrix, and 
let A £ M nxm (F). Then EyA has as its i-th row the j-fh row of A and 0 F ’s everywhere else. Similarly, 
AEy has as its j-th column the i-th column of A and Op’s everywhere else. 

Proof: Put (cxuv) = Eij A. To obtain Ey A we multiply the rows of Ey by the columns of A. Now 

n n 

0C U v = } CukU-kv = } ^ui^kjC-kv 6uiUjv* 
k =1 k =1 

Therefore, for u f i, a uv = Op, i.e., off of the i-th row the entries ofEyA are Op, and at v = aj V , 
that is, the i-th row o/'Ey A is the j -th row of A. The case for AEy is similarly argued.d 


The following corollary is immediate. 
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108 Corollary Let (EipEia) E (M nxn (F)) 2 , be square elementary matrices. Then 

EijE]<i = 6ji<Eii. 

109 Example Let M E M nxn (F) be a matrix such that AM = MA for all matrices A E M nxn (F). 
Demonstrate that M = al n for some a € F, i.e. Mis a scalar matrix. 

Solution: ► Assume (s,t) E {1 , 2, . . . , n} 2 . Let M = (mtj) and E st E M nxn (F). Since M 
commutes with E st we have 


0 

0 . 

. 0 


0 

0 . 

mis 

.. 0 





0 

0 

m 2s 

: 0 

m t1 

m t2 . 

• TU tn 

= E st M = ME st = 









0 

0 

1 )s 

: 0 

0 

0 . 

0 


0 

0 

TTlns 

: 0 


For arbitrary s yt we have shown that m st = m ts = 0, and that m ss = m tt . Thus the entries off 
the main diagonal are zero and the diagonal entries are all equal to one another, whence M is a 
scalar matrix. < 

1 10 Definition Let A E F and E t j E M nxn (F). A square matrix in M nxn (F) of the form I n + AE t j is called 
a transvection. 


1 1 1 Example The matrix 


1 0 4 


T = I3 + 4Ei 3 


0 1 0 


0 0 1 


is a transvection. Observe that if 


1 1 1 


A = 


5 6 7 


1 2 3 


then 


TA = 


1 0 4 


1 1 1 


5 9 13 

0 1 0 


5 6 7 

= 

5 6 7 

1 

O 

O 

l 


1 2 3 


1 2 3 


that is, pre-multiplication by T adds 4 times the third row of A to the first row of A. Similarly, 


1 1 1 


1 0 4 


1 1 5 

5 6 7 


0 1 0 

= 

5 6 27 

1 2 3 


0 0 1 


1 2 7 


AT = 
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that is, post-multiplication by T adds 4 times the first column of A to the third row of A. 


In general, we have the following theorem. 

112 Theorem (Multiplication by a Transvection Matrix) Let I n + AE tj e M nxn (F) be a transvection and let 
A E M nxm (F). Then (I n + AE,, )A adds the j-th row of A to its i-th row and leaves the other rows 
unchanged. Similarly, if B 6 M pxn (F), B(T n + AEp) adds the i-th column of B to the j-th column and 
leaves the other columns unchanged. 

Proof: Simply observe that (I n + AEij ) A = A + AE t j A and A(I n + AEp ) = A + AAE t j and apply 

Theorem 107. □ 

Observe that the particular transvection I n + (A — 1 F )Eu E M nxn (F) consists of a diagonal matrix with 
Vs everywhere on the diagonal, except on the ii-th position, where it has a A. 

1 13 Definition If A Op, we call the matrix I n + (A — T F )Eu a dilatation matrix. 


1 14 Example The matrix 


4 0 0 


S = I 3 + (4 — 1)En 


0 


0 


0 0 1 


is a dilatation matrix. Observe that if 


1 1 1 


A = 


6 7 


then 


1 2 3 


4 0 0 


1 1 1 


4 4 4 

0 1 0 


5 6 7 

= 

5 6 7 

0 0 1 


1 2 3 


1 2 3 


that is, pre-multiplication by S multiplies by 4 the first row of A. Sfmilarly, 


AS = 


1 1 1 


4 0 0 


4 1 1 

5 6 7 


0 1 0 

= 

20 6 7 

1 2 3 


0 0 1 


4 2 3 


that is, post-multiplication by S multiplies by 4 the first column of A. 

1 15 Theorem (Multiplication by a Dilatation Matrix) Pre-multiplication of 

the matrixA E M nxm (F) by the dilatation matrix I n + (A — 1 F )Eu E M nxn (F) multiplies the i-th row of 
A by A and leaves the other rows of A unchanged. Similarly, if B E M pxn (F) post-multiplication of B by 
In + (A — lpjE u multiplies the i-th column of B by A and leaves the other columns of B unchanged. 
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Proof: This follows by direct application of Theorem 112. □ 

1 16 Definition We write IJ’ for the matrix which permutes the i-th row with the j-th row of the identity 
matrix. We call 1^ a transposition matrix. 

1 1 7 Example We have 


If 


10 0 0 
0 0 10 
0 10 0 
0 0 0 1 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 


then 


and 


1 

2 

3 

4 

9 

10 

11 

12 

5 

6 

7 

8 

13 

14 

15 

16 

1 

3 

2 

4 

5 

7 

6 

8 

9 

11 

10 

12 

13 

15 

14 

16 


118 Theorem (Multiplication by a Transposition Matrix) If A e M nxm (F), then I^A is the matrix obtained 
from A permuting the the i-th row with the j-th row of A. Similarly, if B 6 M pxn (F), then BIJ’ is the 
matrix obtained from B by permuting the i-th column with the j-th column of B. 

Proof: We must prove that I^A exchanges the i-th and j-th rows but leaves the other rows 

unchanged. But this follows upon observing that 

= In + Eij + Ejt — Eu — Ejj 


and appealing to Theorem 107. 

□ 
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1 19 Definition A square matrix which is either a transvection matrix, a dilatation matrix or a transposi- 
tion matrix is called an elimination matrix. 

In a very loose way, we may associate pre-multiplication of a matrix A by another matrix 
with an operation on the rows of A, and post-multiplication of a matrix A by another with an 
operation on the columns of A. 


Homework 


Problem 2.4.1 Consider the matrices 


A = 


1 0 10 
0 10 1 
-1111 
1-111 


4-242 
0 10 1 
11-11 
1-111 


Find a specific dilatation matrix D, a specific transposi- 
tion matrix P, and a specific transvection matrix T such 
that B = TDAP. 


Problem 2.4.2 The matrix 


A = 


a b c 
d e f 
g h i 


is transformed into the matrix 

h- g g i 
e d d f 
2b — 2a 2a 2c 


by a series of row and column operations. Find explicit 
permutation matrices P, P ' , an explicit dilatation matrix 
D , and an explicit transvection matrix T such that 

B = DPAP'T. 


Problem 2.4.3 Let A £ M n xn(F). Prove that if 
(VX £ Mn X n(F)), (tr(AX) =tr(BX)), 
then A = B . 


Problem 2.4.4 Let A £ Mr, 

(VX £ Mnxn 
Prove that A = O n . 


be such that 


, ((XA ) 2 =O n 


2.5 Matrix Inversion 

120 Definition Let A E M mxn (F). Then A is said to be left-invertible if 3L E M nxm (F) such that LA = I n . 
A is said to be right-invertible if 3R E M nxm (F) such that AR = I m . A matrix is said to be invertible if it 
possesses a right and a left inverse. A matrix which is not invertible is said to be singular. 


121 Example The matrix A E M 2 X 3(R) 



0 0 


0 1 0 


has infinitely many right-inverses of the form 


1 0 


R 


(*,M) 


0 1 


* y 
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r 



1 

0 



1 

1 0 

0 


0 

1 

— 

1 

0 

0 1 

0 





0 

1 




X 

y 





regardless of the values of x and y . Observe, however, that A does not have a left inverse, for 


a 

b 






a 

b 

0 




1 

0 

0 





c 

d 


0 

1 

0 


c 

d 

0 

f 

9 






f 

9 

0 


which will never give I3 regardless of the values ofa,b,c,d,f,g. 

122 Example If A =/ 0 , then the scalar matrix AI n is invertible, for 

(Ain) (A - 1 In) = In = (A^In) (AI n ) . 


123 Example The zero matrix O n is singular. 

124 Theorem Let A E M nX n(F) a square matrix possessing a left inverse L and a right inverse R. Then 
L = R. Thus an invertible square matrix possesses a unique inverse. 

Proof: Observe that we have LA = I n = AR. Then 

L = LI n = L(AR) = (LA)R = I n R = R. 


□ 

125 Definition The subset of M nX n(F) of all invertible n x n matrices is denoted by GL n (F), read “the 
linear group of rank n over F.” 

126 Corollary Let (A, B) E (GL n (F)) 2 . Then AB is also invertible and 

(AB) -1 = B 1 A 1 . 


Proof: Since AB is a square matrix, it suffices to notice that 

B 1 A 1 (AB) = (AB)B 1 A 1 = I n 

and that since the inverse of a square matrix is unique, we must have B 1 A 1 = (AB) -1 . □ 

127 Corollary If a square matrix S E M nX n(F) is invertible, then S 1 is also invertible and ( S 1 ) 1 = S, 
in view of the uniqueness of the inverses of square matrices. 

128 Corollary If a square matrix A E M nX n(F) is invertible, then A T is also invertible and (A 1 ) 1 = 
(A -1 ) T . 

Proof: We claim that (A 1 ) 1 = (A -1 ) T . For 

AA -1 =In =* (AA-V = i; =»■ (A - 1 ) T A T =I n , 


where we have used Theorem 91 . □ 
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The next few theorems will prove that elimination matrices are invertible matrices. 

129 Theorem (Invertibility of Transvections) Let I n + AE U e M nxn (F) be a transvection, and let i =/ ). Then 

(In + AEij )- 1 = I n — AEij . 

Proof: Expanding the product 

(I n + AEtj ) (I n AEtj ) = In T AEtj AEtj A^EtjEtj 

= In A 2 6tjEtj 

= In, 

since i =/ ) . □ 

130 Example By Theorem 129, we have 


1 0 3 


1 0 -3 


1 0 0 

0 1 0 


0 1 0 

= 

0 1 0 

0 0 1 


0 0 1 


0 0 1 


131 Theorem (Invertibility of Dilatations) Let A =/0 F . Then 

(In + (A - l^Eu)- 1 = In + (A- 1 - 1 f )E u . 

Proof: Expanding the product 

(In + (A-1 F )E it )(In+ (A- 1 -1f)E U ) = In T (A T F )E it 

+ 1A- 1 -T f )Eu 
+ (A-1 f )(A- 1 -1 f )Eu 
= In + (A — 1 F )Eii 
+ 1A- 1 -T f )Eu 
+ (A-1 f )(A- 1 -1 f ))Eu 
= In+(A — 1 F + A 1 — 1 F + 1 F 
-A -A- 1 -l F ))E ti 


proving the assertion. □ 
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132 Example By Theorem 131, we have 


1 0 0 


1 0 0 


1 0 0 

0 2 0 


0 \ 0 

= 

0 1 0 

1 

O 

O 

l 


0 0 1 


0 0 1 


Repeated applications of Theorem 131 gives the following corollary. 
133 Corollary If Ai A 2 A 3 • • • A n y Op, then 

Ai 0 0 0 • •• 0 

0 A 2 0 0 • • • 0 

0 0 A 3 0 • • • 0 

0 0 0 0 • • • A n 

is invertible and 


Ai 

0 

0 

0 • • 

• 0 

-1 

V 1 

0 

0 

0 •• 

0 

0 

a 2 

0 

0 • • 

• 0 


0 

a 2 1 

0 

0 •• 

0 

0 

0 

A 3 

0 •• 

• 0 

= 

0 

0 


0 •• 

0 

0 

0 

0 

0 •• 

• An 


0 

0 

0 

0 •• 

• V 


134 Theorem (Invertibility of Permutation Matrices) Let t 6 S n be a permutation. Then 

dn’r 1 = dis- 

proof: By Theorem 118 pre-multiplication o/'I^ by id exchanges the i-th row with the j-th row, 

meaning that they return to the original position in I n . Observe in particular that id = (Id ) T , and 
SO Id (I^) T = In. □ 

135 Example By Theorem 134, we have 


1 0 0 


1 0 0 


1 0 0 

0 0 1 


0 0 1 

= 

0 1 0 

0 1 0 


0 1 0 


0 0 1 


136 Corollary If a square matrix can be represented as the product of elimination matrices of the same 
size, then it is invertible. 
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Proof: This follows from Corollary 126, and Theorems 129, 131, and 134. □ 


137 Example Observe that 


1 0 0 


A = 


0 3 4 


0 0 1 


is the transvection I3 + 4E23 followed by the dilatation of the second column of this transvection by 3 . 
Thus 


1 0 0 


1 

O 

O 

1 


1 0 0 

0 3 4 

= 

0 1 4 


0 3 0 

0 0 1 


0 0 1 


0 0 1 


and so 


r 



-1 




- 

1 



1 

0 

0 


1 

0 

0 


1 

0 0 

0 

3 

4 

= 

0 

3 

0 


0 

1 4 

0 

0 

1 


0 

0 

1 


0 

0 1 





1 

0 

0 


1 

0 

0 




= 

0 

1 

3 

0 


0 

1 

-4 





0 

0 

1 


0 

0 

1 


1 0 0 


0 0 1 


1 1 1 


1 1 0 


1 0 0 

0 1 1 

= 

0 1 0 


0 1 1 

0 0 1 


0 0 1 


0 0 1 


138 Example We have 
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hence 


r 



-1 





1 

r 



1 

1 

1 


1 

0 

0 



1 

1 

0 

0 

1 

1 

= 

0 

1 

1 



0 

1 

0 

0 

0 

1 


0 

0 

1 



0 

0 

1 





1 

0 

0 


1 

-1 

0 




= 

0 

1 

— 

1 


0 

1 

0 





0 

0 

1 



0 

0 

1 





1 

-1 


0 







= 

0 

1 


— 

1 








0 

0 


1 








In the next section we will give a general method that will permit us to find the inverse of a square 
matrix when it exists. 


139 Example Let T = 


a b 
c d 


E M2x 2(K)- Then 


a b 


d -b 


1 0 




= (ad — be) 


c d 


— c a 


0 1 


Thus if ad — b c^O we see that 


T - 1 


d b 

ad— be ad— be 

c a 

ad— be ad— be 


140 Example If 


1111 
1 1 - 1-1 
1-1 1 -1 


1 -1 -1 


1 
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then A is invertible, for an easy computation shews that 

1111 
1 1 - 1-1 
1-1 1 -1 
1 -1 -1 1 

whence the inverse sought is 


A 2 = 


= 414 


, 1 1 
A- 1 = -A = - 
4 4 


1 

1 

1 

1 


1/4 

1/4 

1/4 

1/4 

1 

1 

-1 -1 


1/4 

1/4 

- 1/4 

- 1/4 

1 -1 

1 

-1 


1/4 

- 1/4 

1/4 

- 1/4 

1 -1 

-1 

1 


_ l /4 

- 1/4 

- 1/4 

1/4 _ 


141 Example AmatrixA 6 M nxn (R) is said to be nilpotent of index k if satisfies A =f 0 n , A 2 f 0 n , . . . ,A k 1 f 
0 n and A k = 0 n for integer k > 1 . Prove that if A is nilpotent, then I n — A is invertible and find its inverse. 

Solution: ► To motivate the solution, think that instead of a matrix, we had a real number x 
with |x| < 1 . Then the inverse of 1 x is 

(1 — x)~ T = =1 + x + x 2 + x 3 H . 

1 —X 

Notice now that since A k = 0 n , then A p = 0 n for p > k. We conjecture thus that 

(I n - A)- 1 = In + A + A 2 + • • • + A k - 7 . 

The conjecture is easily verified, as 


(I n -A)(I n +A + A 2 + ...+A k - 1 ) = In + A + A 2 + ---+A k - 1 

— (A + A 2 + A 3 4 1- A k ) 


= I 


n 


and 


(I n +A + A 2 + ---+A k - 1 )(In-A) = I n — A + A — A 2 + A 3 — A 4 + • • • 

b A k - 2 - A k - 1 + A k_1 - A k 


= I 


n • 


◄ 
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142 Example The inverse of A 6 M3 X 3(Zs), 


is 


as 


A = 


2 

0 

0 


0 

3 

0 


0 

0 

4 


A - 1 


3 0 0 
0 2 0. 
0 0 4 


2 0 0 


3 0 0 


1 0 0 

0 3 0 


0 2 0 

= 

0 1 0 

0 0 4 


0 0 4 


0 0 1 


143 Example (Putnam Exam, 1991) Let A and B be different n x n matrices with real entries. If A 3 = B 3 
and A 2 B = B 2 A, prove that A 2 + B 2 is not invertible. 


Solution: ► Observe that 

(A 2 + B 2 )(A - B) = A 3 - A 2 B + B 2 A - B 3 = O u 
If A 2 + B 2 were invertible, then we would have 

A — B = (A 2 + B 2 )- 1 (A 2 + B 2 )(A-B) = O n , 
contradicting the fact that A and B are different matrices. ◄ 

144 Lemma If A g M nxn (F) has a row or a column consisting all of Op’s, then A is singular. 


Proof: If A were invertible, the (i, i)-th entry of the product of its inverse with A would be Ip. 
But if the i-th row of A is all Op ’s, then X.k=i a-ikbkt = Op, so the (i, i) entry of any matrix product 
with A is Op, and never Ip. □ 


1 1 1 


Problem 2.5.1 The inverse of the matrix A = -\ 


2 


Problem 2.5.2 A square matrix A satisfies A 3 f O n but 
A 4 = O n . Demonstrate that I n + A is invertible and find, 
with proof, its inverse. 


is the matrix A 1 


a 1 -1 


2 3 


Problem 2.5.3 Prove or disprove! If (A,B, A + B) g 
(GL„(R )) 3 then (A + B ) -1 = A -1 + B 1 . 


1 b 


1 . Determine a and b. 


-1 1 0 


Problem 2.5.4 Let S g GL n (F), (A,B) g (M n x-n(F)) 2 , 
and k a positive integer. Prove that if B = SAS 1 then 
B k = SA k S~ T . 


Problem 2.5.5 Let A g M nxn (F) and let k be a positive 


44 


Chapter 2 


Integer. Prove that A is invertible if and only if A k is 
invertible. 


a b b ••• b 
b a b ••• b 


Problem 2.5.6 Let S G GLn(C), A G M n xn(C) with 
A k = O n for some positive integer k. Prove that both 
I n — SAS ' and I n — S 1 AS are invertible and find their 
inverses. 


Problem 2.5.10 Let A 


b b a 


b 


G 


Problem 2.5.7 Let A and B be square matrices of the 
same size such that both A — B and A + B are invertible. 
Put C = (A - B) -1 + (A + B) 1 . Prove that 


ACA - ACB + BCA - BCB = 2A. 


b b b ••• a 

Mnxn(F), n > 1, (a, b) G F 2 . Determine when A is in- 
vertible and find this inverse when it exists. 

Problem 2.5.11 Let (A,B) G (M n xn(F )) 2 be matrices 
such that A + B = AB. Demonstrate that A — I n is in- 
vertible and find this inverse. 


Problem 2.5.8 Let A, B,C be non-zero square matri- 
ces of the same size over the same field and such that 
AB C = 0 n . Prove that at least two of these three matri- 
ces are not invertible. 


Problem 2.5.12 Let S G GL n (F) and A G M nxn (F). 
Prove that tr (A) = tr (SAS -1 ) . 

Problem 2.5.13 Let A G M nX n(R) be a skew-symmetric 
matrix. Prove that I n + A is invertible. Furthermore, if 
B = (In - A) (In + A) -1 , prove that B 1 = B T . 


Problem 2.5.9 Let (A,B) G fM nX n(F) ) 2 be such that 
A 2 = B 2 = (AB) 2 = I n . Prove that AB = BA. 


Problem 2.5.14 A matrix A G M n xn(F) is said to be a 
magic square if the sum of each individual row equals 
the sum of each individual column. Assume that A is 
a magic square and invertible. Prove that A -1 is also a 
magic square. 


2.6 Block Matrices 

145 Definition Let A G M mX n(F). B G M mxs (R)> C G M rxn (F), D G M rxs (F). We use the notation 


r 

1 

A 

B 

C 

D 


for the block matrix L G M (m+r ) X(Tl+S )(F). 

^ If( A, A 1 ) G (M m (F)) 2 , (B, B') G (M mxn (F)) 2 , (C,C') G (M nxm (F)) 2 , (D, D') G (M m (F)) 2 , 
and 


r 





A 

B 

, T = 

A' 

B' 





c 

D 


C' 

D' 


then it is easy to verify that 


AA' + BC' 

AB' + BD' 

CA' + DC' 

CB' + DD' 


ST = 
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146 Lemma Let L E M (Tn+r ) x (m+r) (F) be the square block matrix 



1 

A 

c 

Orxm 

B 


with square matrices A E M m (F) and B E M rxr (F), and a matrix C E M mxr (F). Then L is invertible if 
and only if A and B are, in which case 






A- 1 

—A 1 CB 1 


Orxm 

B 1 


Proof: Assume first that A, and B are invertible. Direct calculation yields 


A 

c 


A- 1 

i 

pa 

U 

7 

< 

1 


AA- 1 

AA 1 CB 1 + CB 1 

Orxm 

B 


®rxm 

B 1 


Qr xm 

BB 1 


r 


Im 

Omxr 

Orxm 

Ir 




— lm+r- 


Assume now that L is invertible, L 1 
say, B is singular. Then 


H 


, with E E M m (F) and K E M rxr (F), but that, 


J K 




Im 

Omxr 

®rxm 

Ir 




EL 1 


r 




1 

A 

c 


E 

H 

Orxm 

B 


J 

K 


AE + CJ 

AH + BK 

BJ 

BK 


which gives BK = I r , i.e., B is invertible, a contradiction. □ 


2.7 Rank of a Matrix 

147 Definition Let (A,B) E (M mxn (F)) 2 . We say that A is row-equivalent to B if there exists a matrix 
R E GL m (F) such that B = RA. Similarly, we say that A is column-equivalent to B if there exists a matrix 
C E GL m (F) such that B = AC. We say that A and B are equivalent if 3(P, Q) E GL m (F) x GL n ( F ) such 
that B = PAQ. 
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148 Theorem Row equivalence, column equivalence, and equivalence are equivalence relations. 

Proof: We prove the result for row equivalence. The result for column equivalence, and 

equivalence are analogously proved. 


Since I m G GL m (F) and A = I m A, row equivalence is a reflexive relation. Assume (A,B) G 
(M mxn (F)) 2 and that 3P G GL m (F) such that B = PA. Then A = P _1 B and since P 1 G 
GL m (F), we see that row equivalence is a symmetric relation. Finally assume (A,B,C) G 
(M mxn (F)) 3 andthat 3P G GL m (F), 3P' G GL m (F) suchthat A = PB,B = P'C. Then A = PP'C. 

But PP' G GL m (F) in view of Corollary 126. This completes the proof. □ 

149 Theorem Let A G M mxn (F). Then A can be reduced, by means of pre-multiplication and post- 
multiplication by elimination matrices, to a unique matrix of the form 


Ir 

®rx (n— r) 

®(m — r) xr 

0(m — r) X (n — r) 


(2.16) 


called the Hermite normal form of A. Thus there exist P G GL m (F),Q G GL n (F) such that D lnnr = PAQ. 
The integer r > 0 is called the rank of the matrix A which we denote by rank ( A) . 


Proof: If A is the m x n zero matrix, then the theorem is obvious, taking r = 0. Assume hence 

that A is not the zero matrix. We proceed as follows using the GauJ3 -Jordan Algorithm. 

GJ-1 Since A is a non-zero matrix, it has a non-zero column. By means of permutation matrices 
we move this column to the first column. 

GJ-2 Since this column is a non-zero column, it must have an entry a =/ CL. Again, by means of 
permutation matrices, we move the row on which this entry is to the first row. 

GJ-3 By means of a dilatation matrix with scale factor a 1 , we make this new (1,1) entry into a 

If- 

GJ-4 By means of transvections (adding various multiples of row 1 to the other rows) we now 
annihilate every entry below the entry (1,1). 

This process ends up in a matrix of the form 


Pi AQi 


If 

* 

* 

• * 

Of 

b 22 

b23 

• b 2n 

Of 

b32 

633 

* fa3n 

Of 


: 

: 

Of 

fa m 2 

^ m3 * 

’ fainn 


(2.17) 


Here the asterisks represent unknown entries. Observe that the b’sform a(m— 1) x (n — 1 ) 
matrix. 


GJ-5 Apply GJ-1 through GJ-4 to the matrix of the b ’s. 
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Observe that this results in a matrix of the form 


PzAQ 2 = 


Ip 

* 

* 

. * 

Op 

Ip 

* 

* 

Op 

Op 

C33 

* C3 n 

Op 


: 

: 

Op 

Op 

Cm 3 * 

■ c mTV 


( 2 . 18 ) 


GJ-6 Add the appropriate multiple of column 1 to column 2, that is, apply a transvection, in order 
to make the entry in the (1,2) position Op. 

This now gives a matrix of the form 


P3AQ3 = 


Ip 

Op 

* 

• * 

Op 

Ip 

* 

* 

Op 

Op 

C33 

* C3 n 

Op 


: 

: 

Op 

Op 

Cm 3 * 

* c mTV 


( 2 . 19 ) 


The matrix of the c’s has size (m — 2) x (n — 2). 

GJ-7 Apply GJ-1 through GJ-6 to the matrix of the c’s, etc. 

Observe that this process eventually stops, and in fact, it is clear that rank (A) < min(m, n) . 


Suppose now that A were equivalent to a matrix D m n s with s > r. Since matrix equivalence 
is an equivalence relation, D m n s and D m n r would be equivalent, and so there would be 
R <E GL m (F), S <E GLn(F), such that RD m , n>r S = D m , n>s , that is, RD m , n r = D m>n>s S _1 . 
Partition R and S 1 as follows 



Rn 

Rl 2 


Si 1 

Sl 2 

Si 3 

R = 

0 — 1 

S21 

S 3 ! 

S22 

S32 

S 23 

S33 

R21 

R 22 

) ^ 


with (Rn, Sn ) 2 € (M rxr 


(F)) 2 ,S 2 2 € M( S _ r)x(s _ r) (F). We have 


P 







-| 

Rn 

Rl 2 


Ir 

Q(m— r) xr 


Rn 

®(m — r) xr 

R21 

R22 


Q(m— r) xr 

O r x (m — r) 


R21 

®rx (m — r) 
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and 









Ir 

Orx(s-r) Orx(n— s) 


Si i 

Sl 2 Si 3 

D m ,n,sS ^ — 

0( s — r) xr 

Is— r 0 (s— r)x(n— s) 


S 21 

S 22 S 23 


0(m— s) Xr 

0(m — s)x(s — r) 0(m — s)x(n— s) 


S3! 

S 32 S 33 


r 



Si i 

Sl2 

Si 3 

S21 

S 22 

S23 

0(m— s) xr 

0(m — s) X (s — r) 

0(m— s) X (n— s) 


Since we are assuming 


r 


R 11 

0(m — r)xr 

R 21 

Orx (m— r) 


we must have Si 2 = O rX (s— r)> Si 3 




-1 

Si 1 

Sl 2 

Si 3 

S 21 

S 22 

S23 

0(m — s) Xr 

0(m— s) X (s — r) 

0(m— s) X (n— s) 


— ®rx(n-s)i S 22 — 0(s— r) x (s— r) > S 23 — 0(s— r)x(n- 


s) 


. Hence 







Si 1 

®rx(s-r) 

Orx (n— s) 

S- 1 = 

S21 

0( S -r}x(s-r) 

0(s — r) X (n— s) 


S3! 

S 32 

S 33 


The matrix 


0(s— r) X (s— r) 

®(s — r} X (n— s) 

S 32 

S 33 


is non-invertible, by virtue of Lemma 144. This entails that S 1 is non-invertible by virtue of 
Lemma 146. This is a contradiction, since S is assumed invertible, and hence S 1 must also be 
invertible. □ 


Albeit the rank of a matrix is unique, the matrices P and Q appearing in Theorem 1 49 are 
not necessarily unique. For example, the matrix 


1 0 
0 1 
0 0 

has rank 2, the matrix 

1 0 x 

0 1 y 
0 0 1 
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is invertible, and an easy computation shews that 


1 

0 

X 


i 

0 





1 

0 








1 

0 




0 

1 

y 


0 

1 


0 

1 


0 

1 

0 

0 

i 


0 

0 





0 

0 


regardless of the values of x and y . 

150 Corollary Let A E M mxn (F). Then rank (A) = rank (A T ). 


Proof: Let P, Q, D m Ti r as in Theorem 149. Observe that P T , Q T are invertible. Then 

PAQ = D m n r =4 Q t A t P t = D^ n r = D n , m , r , 
and since this last matrix has the same number of 1 F ’s as D mnr , the corollary is proven. □ 


151 Example Shew that 


0 2 


A = 


3 


0 1 0 


has rank (A) = 2 and find invertible matrices P E GL 2 (R) and Q E GL ^ ( 1. ) such that 


PAQ 


1 0 0 
0 1 0 


Solution: ► We first transpose the first and third columns by effecting 


0 2 3 
0 1 0 


0 0 1 
0 1 0 
1 0 0 


3 2 0 
0 1 0 


We now subtract twice the second row from the first, by effecting 


1 

1 

K> 

i 


3 2 0 


3 0 0 

0 1 


0 1 0 


1 

O 

o 

1 


Finally, we divide the first row by 3, 


1/3 0 


i 

1 

o 

o 


i 

o 

o 

1 

1 

O 

i 


1 

o 

1 0 


1 

o 

o 

1 


We conclude that 


1/3 0 


i 

—2 


0 2 3 

0 1 


0 

1 


0 1 0 


0 0 1 
0 1 0 
1 0 0 


1 0 0 
0 1 0 
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from where we may take 


and 


1/3 0 


1 

-2 


1/3 -2/3 

0 1 


0 

1 


0 1 


0 0 1 


Q 


o 


0 • 


1 0 0 


◄ 


In practice it is easier to do away with the multiplication by elimination matrices and perform row 
and column operations on the augmented (m + n) x (m + n) matrix 



" 

In 

Onxm 

A 

Im 

*- 



152 Definition Denote the rows of a matrix A E M mxn (F) by Ri , R 2 , . . . , R m , and its columns by Ci , C 2 , . . . , C n 
The elimination operations will be denoted as follows. 


• Exchanging the i-th row with the j-th row, which we denote by R ; h Rj, and the s-th column by 
the t-th column by C s <-> C t . 


• A dilatation of the i-th row by a non-zero scalar a E F \ {Op}, we will denote by aR t — » Ri. Similarly, 
(3Cj — > Cj denotes the dilatation of the j-th column by the non-zero scalar |3. 


• A transvection on the rows will be denoted by Ri + aRj — > Ri , and one on the columns by C s + pC t — > 
C s . 


153 Example Find the Hermite normal form of 

-1 0 
0 0 

A = 

1 1 
1 2 


Solution: ► First observe that rank (A) < min(4, 2) = 2, so the rank can be either 1 or 2 (why 
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not 0 ?). Form the augmented matrix 


1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

-1 

0 

1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

1 

1 

0 

0 

1 

0 

1 

2 

0 

0 

0 

1 


Perform R 5 + R3 — » R5 and R 6 + R ; — > R 6 successively, obtaining 


1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 


0 

1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

1 

0 

1 

0 

0 

2 

1 

0 

0 

1 


Perform R 6 — 2R 5 — > R 6 


1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

-1 

0 

1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

1 

0 

1 

0 

0 

0 

-1 

0 

-2 

1 


1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

-1 

0 

1 

0 

0 

0 

0 

1 

1 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

-1 

0 

-2 

1 


Perform R 4 R5 
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Finally, perform — R 3 — > R 3 


1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

1 

0 

-1 

0 

0 

0 

0 

1 

1 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

-1 

0 

-2 

1 


We conclude that 


-10 0 0 


-1 0 



1 0 

10 10 


0 0 


1 0 


0 1 

0 10 0 


1 1 


0 1 


0 0 

- 10-21 


1 2 



0 0 


◄ 

154 Theorem Let A £ M mxn (F), B £ M nxp (F). Then 

rank (AB) < min (rank (A) , rank (B)). 


Proof: We prove that rank (A) > rank (AB). The proof that rank (B) > rank (AB) is similar 

and left to the reader. Put r = rank (A) , s = rank (AB). There exist matrices P £ GL m (F), 
Q £ GL n (F), S £ GL m (F), T £ GL P (F) such that 

PAQ = D mnr , SABT = D mps . 


Now 


D m , p , s = SABT = SP 1 D m , nr Q 1 BT, 

from where it follows that 


PS = D mnr Q 1 BT. 

Now the proof is analogous to the uniqueness proof of Theorem 149. Put U = PS -1 £ GL n 
and V = Q 1 BT £ M nxp (F), and partition U and V as follows: 


U 


U11 

U 12 

U 21 

U 22 


v = 


Vn 

Vi 2 

v 21 

v 22 


with Ui 1 £ M s (F),Vn £ M rxr (F). Then 


UD 


TTT , p , S 


U 17 

U 12 

U 21 

U 22 


Is 

Os X (p — s) 

0(m — s) xs 

0(m— s) X (p — s) 


£ M mxp (F), 
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and 


Ir 

0 r x (n— r) 


V„ 

Vi 2 

Q(m— r) X r 

0(m — r) X (n — r) 


V 21 

V 22 


From the equality of these two m x p matrices, it follows that 


e M mxp (F). 


U11 

Osx (p— s} 


Vn 

v 12 

U21 

0(m— s) X (p— s) 


Q(m— r) xr 

0(m— r) X (n— r) 


If s > r then (i) Ui i would have at least one row of Of ’s meaning that Ui ^ is non-invertible by 
Lemma 144 . (ii) U21 = 0 (m _ S ) Xs . Thus from (i) and (ii) and from Lemma 146 , U is not invertible, 
which is a contradiction. □ 

155 Corollary Let A 6 M mxn (F), B 6 M nxp (F). If A is invertible then rank (AB) = rank (B). If B is 
invertible then rank (AB) = rank (A). 

Proof: Using Theorem 1 54 , if A is invertible 

rank (AB) < rank (B) = rank (A 1 AB) < rank (AB) , 

and so rank (B) = rank (AB). A similar argument works when B is invertible. 

□ 


156 Example Study the various possibilities for the rank of the matrix 


A = 


1 1 1 

b+c c+a a+b 
be ca ab 


Solution: ► Performing R2 — (b + c)Rt — > R2 and R3 — bcRi — > R3, we find 

1 1 1 

0 a — b a — c 

0 0 (b — c) (a — c) 

Performing C2 — Ci — > C2 and C3 Ci — ; C3, we find 

1 0 0 

0 a — b a — c 

0 0 (b — c) (a — c) 

We now examine the various ways of getting rows consisting only of 0 ’s. If a = b = c, the last 
two rows are 0 -rows and so rank (A) = 1 . If exactly two of a, b, c are equal, the last row is a 
0 -row, but the middle one is not, and so rank (A) = 2 in this case. If none of a, b, c are equal, 
then the rank is clearly 3 . ◄ 
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Homework 

Problem 2.7.1 On a symmetric matrix A G M nX n(R) with n > 3, 

R3 — 3 Ri — > R3 


successively followed by 


C3 — 3Ci —r C3 

are performed. Is the resulting matrix still symmetric? 


Problem 2.7.2 Find the rank of 


a + 1 

a + 2 
a + 3 
a + 4 

Problem 2.7.3 Let A, B be arbitrary n 


a + 2 

a + 3 

a + 

a + 3 

a + 4 

a + 

a + 4 

a + 5 

a + 

a + 5 

a + 6 

a + 


X n matrices over R. 


4 a + 5 

5 a + 6 

6 a + 7 

7 a + 8 


e Msxst 


Prove or disprove! rank (AB) 


rank (BA) . 


Problem 2.7.4 Determine the rank of the matrix 


1 

0 

2 

2 


1 0 
0 1 
2 2 
0 0 


0 

1 

2 

2 


Problem 2.7.5 Suppose that the matrix 


4 2 


G M 2 X 2 (R) has rank 1 . How many possible values can x assume? 


Problem 2.7.6 Demonstrate that a non-zero n x n matrix A over a field F has rank 1 if and only if A can be factored 
as A = XY, where X g M nx i (F) and Y g Mi X n(F). 


Problem 2.7.7 Study the various possibilities for the rank of the matrix 


1 a 1 b 

a 1 b 1 

1 b 1 a 

b 1 a 1 


when (a, b) g R 2 . 
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1-10 1 


Problem 2.7.8 Find the rank of 


m 


1 


1 -1 -1 

-m 1 0 


as a function of m g C. 


1 -1 m 2 


Problem 2.7.9 Determine the rank of the matrix 


Problem 2.7.10 Determine the rank of the matrix 


a 2 

ab 

ab 

b 2 

ab 

a 2 

b 2 

ab 

ab 

b 2 

a 2 

ab 

b 2 

ab 

ab 

a 2 

1 

1 

1 

1 

a 

b 

a 

b 

c 

c 

d 

d 

ac 

be 

ad 

bd 


1 1 2 


Problem 2.7.11 Let A g M 3 X 2 (R). B g M 2 X 2 (R), and C g M 2 X 3 (R) be such that ABC 


-2 x 


1 . Find x. 


1 -2 1 


Problem 2.7.12 Let B be the matrix obtained by adjoining a row (or column) to a matrix A. Prove that either 

rank (B ) = rank (A) or rank (B) = rank (A) + 1 . 


Problem 2.7.13 Let A g M n x n ( R ) . Prove that rank (A) = rank (AA T ). Find a counterexample in the case A g 
M nxn (C). 


Problem 2.7.14 Prove that the rank of a skew- symmetric matrix with real number entries Is an even number. 


2.8 Rank and Invertibility 

157 Theorem A matrix A g M mxn (F) is left-invertible if and only if rank (A) = n. A matrix A g M mxn (F) 
is right-invertible if and only if rank (A) = m. 

Proof: Observe that we always have rank (A) < n. If A is left invertible, then 3L g M nxm (F) 

such that LA = I n . By Theorem 1 54, 

n = rank (I n ) = rank (LA) < rank (A) , 

whence the two inequalities give rank (A) = n. 


Conversely, assume that rank (A) = n. Then rank (A T ) = n by Corollary 150, and so by 
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Theorem 149 there exist P E GL m (F), Q E GL n (F), such that 


PAQ = 


*(m— n) xn 


, q t a t p t = 


Onx(m-n) 


This gives 


Q t A t P t PAQ = I n A T P T PA = (Q T ) 7 Q 1 


CfQ 7 )- 1 Q- 


A t P t PA = I n , 


and so ( ( Q T ) 1 Q 1 ) 1 A T P T P is a left inverse for A. 


The right-invertibility case is argued similarly. □ 


By combining Theorem 157 and Theorem 124, the following corollary is thus immediate. 


158 Corollary If A 6 M mxn (F) possesses a left inverse L and a right inverse R then m = n and L = R. 


We use GauJ3-Jordan Reduction to find the inverse of A E GL n (F). We form the augmented matrix 
T = [A|I n ] which is obtained by putting A side by side with the identity matrix I n . We perform permissible 
row operations on T until instead of A we obtain I n , which will appear if the matrix is invertible. The 
matrix on the right will be A -1 . We finish with [I n |A -1 ]. 


If A E M nxn (R) is non-invertible, then the left hand side in the procedure above will not 
reduce to I n . 


159 Example Find the inverse of the matrix B E M 3 X 3 (Z 7 ), 


6 0 1 


B = 


3 


2 


0 • 


1 


0 


1 
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Solution: ► We have 


6 

0 

1 

1 

0 

0 


1 

0 

1 

0 

0 

1 

3 

2 

0 

0 

1 

0 

R i f-> R3 
* — > 

3 

2 

0 

0 

1 

0 

1 

0 

1 

0 

0 

1 


6 

0 

1 

1 

0 

0 








1 

0 

1 

0 

0 

1 







R3— 6R1 -A R 3 
R2 — 3 R 1 — > R 2 

0 

2 

4 

0 

1 

4 








0 

0 

2 

1 

0 

1 








5 

0 

0 

1 

0 

6 







R2— 2R3— >R2 
5 R i + R 3 — > R 1 

0 

2 

0 

5 

1 

2 








0 

0 

2 

1 

0 

1 








1 

0 

0 

3 

0 

4 







3 R i — > R i ; 4R3-4R3 

•N'A 

4 R 2 — > R2 

0 

1 

0 

6 

4 

1 








0 

0 

1 

4 

0 

4 


We conclude that 


-i 


' |i— 

lo 


3 0 4 

3 2 0 

= 

6 4 T 

T 0 T 


4 0 4 


◄ 


160 Example Use GauJ3-Jordan reduction to find the inverse of the matrix A 


0 1 -1 


4-3 4 


. Also, 


3-3 4 


find A 2001 . 
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Solution: ► Operating on the augmented matrix 


0 1 

-1 

1 

0 

0 


0 

1 

-1 

1 

0 

0 

4 -3 

4 

0 

1 

0 

R 2 — R 3 — > R 2 

1 

0 

0 

0 

1 

-1 

3 -3 

4 

0 

0 

1 


3 

-3 

4 

0 

0 

1 







0 

1 

-1 

1 

0 

c 






R 3 — 3 R 2 — > R 3 

1 

0 

0 

0 

1 

— 







0 

-3 

4 

0 

-3 

4 


R3 + 3 Ri — > R3 


Ri +R 3 ^Ri 


R i <— > R 2 


0 i -i 

1 0 0 
0 0 i 

0 i 0 

1 0 0 

0 0 1 

1 0 0 
0 1 0 
0 0 1 


1 0 0 

0 1 -1 

3-34 

4-34 
0 1 -1 

3 - 34 

0 1 -1 

4- 34 
3-34 


Thus lue deduce that 


A - 1 = 


0 1 -1 
4-3 4 

3-3 4 


= A. 


From A 1 = A we deduce A 2 = I n . Hence A 2000 = (A 2 ) 1000 = I ^ 000 = l n and A 2001 = A(A 2000 ) 

AI n = A. ◄ 

161 Example Find the inverse of the triangular matrix A £ M nxn (K), 

1 1 1 ••• 1 

0 1 1 ••• 1 

A = 0 0 1 ••• 1 


0 0 0 


Rank and Invertibility 
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Solution: ► Form the augmented matrix: 


1 1 1 
0 1 1 
0 0 1 


1 1 0 0 ••• 0 

1 0 1 0 ••• 0 

1 0 0 1 ••• 0 . 


0 0 0 


1 0 0 0 ••• 1 


and perform R k — Rk+i — > Rk successively for k = 1,2, — 1, obtaining 


1 

0 

0 •• 

• 0 

1 

-1 

0 • 

• 0 

0 

1 

0 •• 

• 0 

0 

1 

-1 • 

• 0 

0 

0 

1 •• 

• 0 

0 

0 

1 • 

• 0 

0 

0 

0 •• 

• 1 

0 

0 

0 • 

• 1 


whence 


A- 1 


1 -1 0 

0 1 -1 

0 0 1 


0 

0 

0 - 


00 0 ••• 1 

that is, the inverse of A has 1 ’s on the diagonal and —1 ’s on the superdiagonal. ◄ 

162 Theorem Let A g M nxn (F) be a triangular matrix such that cp i an • ■ • a nn f Of- Then A is invertible. 

Proof: Since the entry cikk f Of we multiply the k -th row by a kk and then proceed to subtract 

the appropriate multiples of the preceding k — 1 rows at each stage. □ 

163 Example (Putnam Exam, 1969) Let A and B be matrices of size 3x2 and 2x3 respectively. Suppose 
that their product AB is given by 


8 


2 


-2 


AB = 


2 5 4 


-2 4 5 
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Demonstrate that the product BA is given by 


BA = 


9 0 
0 9 


Solution: ► Observe that 

8 2-2 
(AB) 2 =254 
-2 4 5 


8 2-2 
2 5 4 

-2 4 5 


72 

18 

-18 

18 

45 

36 

-18 

36 

45 


= 9AB. 


Performing R 3 + R 2 — > R 3 , Ri — 4 R 2 — > R 2 , and 2 R 3 + Ri — > R 3 in succession we see that 


8 

2 

-2 


0 

-18 

-18 


0 

-18 

0 


0 

-18 

0 

2 

5 

4 


2 

5 

4 

* — > 

2 

5 

-1 

~ — > 

0 

5 

-1 

-2 

4 

5 


0 

0 

0 


0 

0 

0 


0 

0 

0 


and so rank (AB) = 2. This entails that rank ((AB) 2 ) = 2. Now, since BA is a 2x2 matrix, 
rank (BA) < 2. Also 

2 = rank ((AB) 2 ) = rank (ABAB) < rank (ABA) < rank (BA) , 
and we must conclude that rank (BA) = 2. This means that BA is invertible and so 


◄ 


(AB) 2 = 9AB 


=4 A(BA-9I 2 )B = 0 3 
=4 BA(BA-9I 2 )BA = B0 3 A 
=4 BA(BA — 9I 2 )BA = 0 2 
=4 (BA)- 1 BA(BA-9I 2 )BA(BA)- 1 

=4 BA — 9I 2 = 0 2 


(BA)- 1 0 2 (BA)- 1 


Homework 

Problem 2.8.1 Find the inverse of the matrix 

T 2 3 

2 3 T 

3 T 2 


€ M3 X 3 (Z 7 ). 
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Problem 2.8.2 Let (A, B ) £ M 3 X 3 (R) be given by 


A = 


a 

b 

c 


0 

0 

-1 

1 

0 

0 

, B = 

0 

-1 

a 

0 

1 

0 


-1 

a 

b 


Find B 1 and prove that A T = BAB 1 . 


Problem 2.8.3 Let A 


1 0 0 

1 1 0 

1 1 x 


where x ^ 0 is a real number. Find A 1 . 


Problem 2.8.4 If the inverse of the matrix M = 


1 

0 

1 


0 

-1 

0 

-1 

0 

0 

is the matrix M 1 = 

-1 

-1 

a 

0 

1 

1 


1 

1 

b 


, find (a, b). 


Problem 2.8.5 Let A 


1 0 0 
1 1 0 
1 1 1 


and let n > 0 be an integer. Find (A 71 


Problem 2.8.6 Give an example of a 2 x 2 invertible matrix A over R such that A + A 1 is the zero matrix. 
Problem 2.8.7 Find all the values of the parameter a for which the matrix B given below is not invertible. 


B = 


-1 a + 2 2 

0 a 1 

2 1 


a 


Problem 2.8.8 Find the inverse of the triangular matrix 

a 2a 3a 
0 b 2b 
0 0c 


e M3 x3( 


assuming that abc y" 0. 
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Problem 2.8.9 Under what conditions is the matrix 

b a 0 
c 0 a 
0 c b 

invertible? Find the inverse under these conditions. 

Problem 2.8.10 Let A and B be n x n matrices over a field F such that AB is invertible. Prove that both A and B 
must be invertible. 

Problem 2.8.11 Find the inverse of the matrix 

1 + a 1 1 

1 1 +b 1 

1 1 1 + c 


Problem 2.8.12 Prove that for the n x n (n > 1) matrix 


r 




-1 





0 

1 

1 . 

. 1 


2 — n 

1 

1 

1 

1 

0 

1 . 

. 1 


1 

2 — n 

1 

1 

1 

1 

0 . 

. 1 

1 

n- 1 

1 

1 

2 — n . 

1 

1 

1 

1 . 

. 0 


1 

1 

1 

. 2 — n 


Problem 2.8.13 Prove that the n x n (n > 1) matrix 

1 + a 1 1 

1 1 + a 1 

1 1 1 + a 


1 

1 

1 


has inverse 


1 


1 + a 


1 — n — a 


1 


1 1 -n- a 1 


1 

a(n + a) 


1 — n — a 


1 

1 


1 


1 


1 


1 


n — a 
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Problem 2.8.14 Prove that 


(2n — 1 ' 


5 7 
3 5 


(2n — 3) (2n- 1) 1 3 


7 9 


(2n-l) 
(2n - 3) 
(2n - 5) 


has inverse 


2 - n 2 2 + n 2 


1 

2n 3 


2 

2 2 

2 + n 2 2 

Problem 2.8.15 Prove that the n x n (n > 1) matrix 


2 - n 2 2 + n 2 


2 - n 2 2 + n 2 


2 

2 

2 

2 — n 2 


1 + cn 1 1 

1 1 + a 2 1 

1 1 1 + a 3 


1 

1 

1 

1 + On 


has inverse 


1 — ai s 


1 


1 


1 


1 


a 2 ai 

1 


<h a 2 
1 - a 2 s 
a? 


a 3 ai a 3 a 2 


ai a 3 
1 

a 2 a 3 
1 — a 3 s 
a? 


ai a n 

1 

a 2 a n 

1 

a 3 am 


1 - a n s 


l a„ai a n a 2 a n a 3 


where s = 1 + — — h — 
a 1 a 2 


Problem 2.8.16 Let A G Ms x s (K). Shew that if rank (A 2 ) < 5, then rank (A) < 5. 

Problem 2.8.17 Let p be an odd prime. How many invertible 2x2 matrices are there with entries all in 7L V ? 


Problem 2.8.18 Let A, B be matrices of the same size. Prove that rank (A + B) < rank (A) + rank (B). 
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0 -1 -1 


Problem 2.8.19 Let A g M.; , 2 ( 1R ) and B g M 2 , 3 (K) be matrices such that AB 


-1 


0 


Prove that 


BA = I 2 . 


2 


Chapter 


3 


Linear Equations 


3.1 Definitions 

We can write a system of m linear equations in n variables over a field F 

ai 1X1 + a 12 x 2 + ai 3 x 3 H b d ln x n = y 1; 

a 2 i%i + a 22 x 2 + a 23 x 3 H b a 2 n x n = y 2 , 


“b Cirri .2^2 “b ^m3^3 T * • • “b d m n^n — firri> 


in matrix form as 


flu cq 2 ••• din 


*1 


yi 

d 21 a 22 • • • a 2n 


x 2 

= 

y2 

dml dm2 * ' * dmn 


*n 


V m 


We write the above matrix relation in the abbreviated form 


(3.1) 


AX = Y, 


(3.2) 


where A is the matrix of coefficients, X is the matrix of variables and Y is the matrix of constants. Most 
often we will dispense with the matrix of variables X and will simply write the augmented matrix of the 
system as 


- 



- 

an 

dl2 

’ ai n 

y^ 

0-21 

d22 

* a2n 

V2 

ami 

dm2 * 

* a mn 

y m 


(3.3) 


164 Definition Let AX = Y be as in 3. 1. If Y = O m xi , then the system is called homogeneous, otherwise it 
is called inhomogeneous. The set 

{X £ M nx i (F) : AX = 0 mx i} 
is called the kernel or nullspace of A and it is denoted by ker ( A) . 
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Observe that we always have O nxl G ker (A) 6 M mxn (F). 

165 Definition A system of linear equations is consistent if it has a solution. If the system does not have 
a solution then we say that it is inconsistent. 


166 Definition If a row of a matrix is non-zero, we call the first non-zero entry of this row a pivot for this 
row. 

167 Definition A matrix M G M mxn (F) is a row-echelon matrix if 

• All the zero rows of M, if any, are at the bottom of M. 

• For any two consecutive rows R t and R l+ i , either R t+ i is all Op’s or the pivot of R i+ i is immediately 
to the right of the pivot of R t . 

The variables accompanying these pivots are called the leading variables. Those variables which are not 
leading variables are the free parameters. 

168 Example The matrices 


^ 1 

(( )) 

o 

1 


CD ° 1 1 

0 0 (Q) 2 


o 

o 

o 

O 

o 

o 

> 

0 0 0 0 

0 0 0 0 


0 0 0 0 


are in row-echelon form, with the pivots circled, but the matrices 


i 

0 

1 

1 


1 

0 

1 

1 

0 

0 

1 

2 


0 

0 

0 

0 

0 

0 

1 

1 

) 

0 

0 

0 

1 

0 

0 

0 

0 


0 

0 

0 

0 


are not in row-echelon form. 

I® 3 Observe that given a matrix A € M mxn (F), by following GauB-Jordan reduction a la Theo- 
rem 149, we can find a matrix P G GL m (F) such that PA = B is in row-echelon form. 

169 Example Solve the system of linear equations 


1111 


X 


-3 

0 2 1 0 


y 


-1 

T 

o 

o 


z 


4 

0 0 0 2 


w 


-6 


Definitions 
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Solution: ► Observe that the matrix of coefficients is already in row-echelon form. Clearly every 
variable is a leading variable, and by back substitution 


2w = —6 


w = = —3, 

2 


z — w = 4 =4> z = 4 + w = 4 — 3 = 1, 

1 1 

2y + z = — 1 =4 y = — - — -z = — 1 , 


x + y + z - 

The (unique) solution is thus 


w = _3 x = 


y 


z — w = 0. 


■ 


■ 

X 


0 

V 



z 


i 

XV 


-3 


◄ 


1 70 Example Solve the system of linear equations 


1 1 1 

0 2 1 
0 0 1 


1 


X 


-3 

0 


y 

= 

-1 

-1 


z 


4 



XV 




Solution: ► The system is already in row-echelon form, and we see that x,y,z are leading 
variables while w is a free parameter. We put w = t. Using back substitution, and operating 
from the bottom up, we find 


2y + z 


x + y+ z + w = —3 
The solution is thus 


z — w = 4 =4> z = 4 + w = 4 + t, 


y 



5 1 9 3 

x = —3 — y — z — w = —3 H 1 — t — 4 — t — t = 1. 

2 2 2 2 


X 


— — - It 
2 2 L 

y 


— 2 ^ l t 
2 2 L 

z 


4 + t 

XV 


t 


◄ 
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171 Example Solve the system of linear equations 



X 



1111 


y 


-3 

0 2 10 


z 


-1 


w 




Solution: ► We see that x,y are leading variables, while z,w are free parameters. We put 
z = s, w = t. Operating from the bottom up, we find 


1 1 

2y + z = — 1 y = z 

2 2 


1 1 


:S, 


5 3 

x + y+ z + w = —3 =£> x = —3 — y — z — w s t. 

2 2 


The solution is thus 


X 


_5 _ 3 t 

2 2 s 1 



1 1 „ 

y 

= 

2 2 S 

z 


S 

w 


t 


, (s, t) 6 


1 72 Example Find all the solutions of the system 

x + 2y + 2z = 0, 


working in Z 3 . 


y + 2z = 1 , 


Solution: ► The augmented matrix of the sgstem is 


" 

- 

T 2 2 

0 

0 T 2 

T 


The sgstem is already in row-echelon form and x, y are leading variables while z is a free 
parameter. We find 

y = 1 — 2z = 1 + lz, 

and 

x = — 2y — 2z = 1 + 2z. 

Thus 


X 


1 +2z 

y 

= 

1 + 1z 

z 


z 


, zez 3 . 


Definitions 


69 


Letting z = 0, 1 , 2 successively, we find the three solutions 


and 


◄ 


X 


T 

y 

= 

T 

z 


0 

X 


0 

y 

= 

2 

z 


T 

X 


2 

y 

= 

0 

z 


2 


Homework 


Problem 3.1.1 Find all the solutions In Z 3 of the system 

x + y+ z + w = 0, 

2y + w = 2. 

Problem 3.1.2 In Z7, given that 


Problem 3.1.5 This problem Introduces Hill block ci- 
phers, which are a way of encoding information with an 
encoding matrix A £ M nX n(^ 2 s), where n is a strictly 
positive integer. Split a plaintext into blocks of n letters, 
creating a series of n x 1 matrices Pk, and consider the 
numerical equivalent (A = 0, B = 1 , C = 2, . . . , Z = 25) 
of each letter. The encoded message is the translation to 
letters of the n x 1 matrices Ck = APk mod 26. 


-1 


12 3 


4 2 0 

2 3 T 

= 

2 0 4 

3 T 2 


0 4 2 


For example, suppose you want to encode the mes- 
sage COMMUNISTS EAT OFFAL with the encoding ma- 
trix 

0 1 0 


A = 


3 


0 0 . 


find all solutions of the system 

lx + 2y + 3z = 5; 

2x + 3y + 1 z = 6; 
3x + ly + 2z = 0. 


0 0 2 

a 3 x 3 matrix. First, split the plaintext into groups of 
three letters: 

COM MUN 1ST SEA TOF FAL. 


Problem 3.1.3 Solve in Z 13 : 


Form 3x1 matrices with each set of letters and find their 
numerical equivalent, for example, 


x — 2y + z = 5, 2x + 2y = 7, 5x — 3y + 4z = 1 . 

Problem 3.1.4 Find, with proof, a polynomial p(x) with 
real number coefficients and degree 3 such that 


C 


2 

O 

= 

14 

M 


12 


P(— 1) “—10, p(0) = —1 , p(l ) =2, p(2) =23. 
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Find the product APi modulo 26, and translate Into let- 
ters: 


system 


1 

o 

o 

1 


2 


14 


O 

3 0 0 


14 

= 

6 

= 

G 

0 0 2 


12 


24 


Y 


xo + xi = 0, 

Xo + X2 = 1 , 

Xo + X 3 = 2, 


hence COM Is encoded Into OGY. Your task Is to com- 
plete the encoding of the message. 


Problem 3.1.6 Find all solutions in Z103, if any, to the 


Xo +xioo = 99, 

Xo + XT + x 2 + • • • + Xioo = 4949. 

Hints: 0 + 1 + 2 h h 99 = 4950, 99- 77-103-74 = 1. 


3.2 Existence of Solutions 

We now answer the question of deciding when a system of linear equations is solvable. 

173 Lemma Let A E M mxn (F) be in row-echelon form, and let X E M nx i (F) be a matrix of variables. 
The homogeneous system AX = O mx i of m linear equations in n variables has (i) a unique solution if 
m = n, (ii) multiple solutions if m < n. 


Proof: If m = n then A is a square triangular matrix whose diagonal elements are different 

from Of- As such, it is invertible by virtue of Theorem 162. Thus 

AX = O nx i ==> X = A _ 1 O nx i = O nx i 

so there is only the unique solution X = O nx i , called the trivial solution. 


If m < n then there are n — m free variables. Letting these variables run through the elements of 
the field, we obtain multiple solutions. Thus if the field has infinitely many elements, we obtain 
infinitely many solutions, and if the field has k elements, we obtain k n m solutions. Observe 
that in this case there is always a non trivial solution. 

□ 

1 74 Theorem Let A E M mxn (F), and let X E M nx i (F) be a matrix of variables. The homogeneous system 
AX = O mx i of m linear equations in n variables always has a non-trivial solution if m < n. 


Proof: We can find a matrix P E GL m (F) such that B = PA is in row-echelon form. Now 

AX = O mx i +=+ PAX = O mx i <+=+ BX = O mx i. 

That is, the systems AX = O mx -\ and BX = O mx i have the same set of solutions. But by Lemma 
1 73 there is a non trivial solution. □ 

1 75 Theorem (Kronecker-Capelli) Let A E M mxn (F), Y e M mx i (F) be constant matrices and X E M nx i (F) 
be a matrix of variables. The matrix equation AX = Y is solvable if and only if 


rank (A) = rank f[A|Y]) . 
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Proof: Assume first that AX = Y, 





X 


n 


Let the columns of [A|X] be denoted by Ct, 1 < i < n. Observe that that [A|X] G M mx(n+1 ) (F) 
and that the (n + 1 )-th column of[A\X] is 


xt an +x 2 a 12 H bx n a ln 


Cn+1 = AX = 


a 2 i + x 2 a 22 H + x n a 2n 


= Y XjCj. 
i=i 


XT a^i + x 2 a n2 + • ’ • + x n a nn 


By performing C n+ T— H]1t XjCj — > C n+ i on [A|Y] = [A|AX] we obtain [A|O n xi]- Thus rank ( [A | Y] ) = 
rank ([A|O nx i]) = rank (A). 


Now assume that r = rank (A) = rank ([A|Y]). This means that adding an extra column to A 
does not change the rank, and hence, by a sequence column operations [A|Y] is equivalent to 
[A ! O nx i]. Observe that none of these operations is a permutation of the columns, since the first 
n columns of [A\Y] and [A|0n X i ] are the same. This means that Y can be obtained from the 
columns Q , 1 < i < n of A by means of transvections and dilatations. But then 

n 

Y=^x t C t . 

i=i 


The solutions sought is thus 


XT 



X 


n 


□ 


Problem 3.2.1 Let A G M n xp(F), B G M nxq (F) and 
put C = [A B] g M nx(p+ql (F) Prove that rank (A) = 


rank ( C ) 


3P g M p (q) such that B = AP. 


3.3 Examples of Linear Systems 

1 76 Example Use row reduction to solve the system 


+ 

2y 

+ 

3z 

+ 

4 w = 

8 

+ 

2y 

+ 

4 z 

+ 

7w = 

12 


2x + 4y 


6z + 8w 


16 
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Solution: ► Form the expanded matrix of coefficients and apply row operations to obtain 


- 










- 

1 

2 

3 

4 

8 

R3 — 2 Ri — » R3 

1 

2 

3 

4 

8 

1 

2 

4 

7 

12 

~ — > 

R2 — Ri — >R2 

0 

0 

1 

3 

4 

2 

4 

6 

8 

16 


0 

0 

0 

0 

0 


The matrix is now in row-echelon form. The variables x and z are the pivots, so w and y are 
free. Setting w = s , y = t we have 

z = 4 — 3s, 

x = 8 — 4w — 3z — 2y = 8 — 4s — 3(4 — 3s) — 2t = —4 + 5s — 2t. 

Hence the solution is given by 


X 


—4 + 5s — 2t 

y 


t 

z 


4 -3s 

w 


s 


177 Example Find agl such that the system 

x + y — z = 1, 

2x + 3y + az = 3, 
x + ay + 3z = 2, 

posses (i) no solution, (ii) infinitely many solutions, (iii) a unique solution. 
Solution: ► The augmented matrix of the system is 


- 



- 

1 

1 

-1 

1 

2 

3 

a 

3 

1 

a 

3 

2 


By performing R2 — 2 Rt — » R2 and R3 Ri — > R3 we obtain 





- 


1 1 

-1 

1 

■W 

0 1 

oc - h 2 

i 


0 a- 1 

4 

i 


By performing R3 — ( a — 1 ) R2 — » R3 on this last matrix we obtain 






- 


1 

1 

-1 

1 


0 

i 

a + 2 

1 


0 

0 

(a — 2)(a + 3) 

a — 2 
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If a = —3, we obtain no solution. If a = 2. there is an infinity of solutions 


X 


5t 

y 

= 

1 -4t 

z 


t 


t £ M. 


If a =f 2 and a f 3, there is a unique solution 


r _ 


■ 

X 


i 

y 


i 


a + 3 

1 



.a + 3. 


◄ 

1 78 Example Solve the system 


6 0 T 


X 


T 

3 2 0 


y 

= 

0 

T 0 T 


z 


2 


for (x,y,z) £ (Z 7 ) 3 . 

Solution: ► Performing operations on the augmented matrix we have 


6 

0 

T 

T 


T 

0 

T 

2 

3 

2 

0 

0 

RihRj 

3 

2 

0 

0 

T 

0 

T 

2 


6 

0 

T 

T 






T 

0 

T 

2 





R3 — 6Ri — > R 3 
R2 — 3 Ri — >R2 

0 

2 

4 

T 






0 

0 

2 

3 


This gives 

2z = 3 =^> z = 5, 

2y = T - 4z = 2 =*> y = T, 
x = 2 — z = 4. 

(x,y,z) = (4,T, 5). 


The solution is thus 


74 


Chapter 3 


Homework 


Problem 3.3.1 Find the general solution to the system 


11111 


a 


1 

o 

o 


b 


-1 

2 12 12 


c 

= 

0 

4 2 4 2 4 


d 


0 

1 

o 

o 

o 


f 


0 


or shew that there is no solution. 

Problem 3.3.2 Find all solutions of the system 


11111 


a 


3 

11112 


b 


4 

1113 3 


c 

= 

7 

114 4 4 


d 


6 

1 2 3 4 5 


f 


9 


if any. 

Problem 3.3.3 Study the system 

x + 2my + z = 4m; 

2mx + y + z = 2; 
x + y + 2mz = 2m 2 , 

with real parameter m. You must determine, with proof, for which m this system has (i) no solution, (ii) exactly one 
solution, and (iii) infinitely many solutions. 

Problem 3.3.4 Study the following system of linear equations with parameter a. 

(2a — 1 )x + ay — (a + 1 )z = 1 , 
ax + y — 2z = 1 , 

2x + (3 — a)y + (2a — 6)z = 1 . 

You must determine for which a there is: (i) no solution, (ii) a unique solution, (iii) infinitely many solutions. 

Problem 3.3.5 Determine the values of the parameter m for which the system 

x + y + (1 — m)z = m + 2 

(1 + m)x — y + 2z =0 

2x — my + 3z = m + 2 


is solvable. 
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Problem 3.3.6 Determine the values of the parameter m for which the system 


is solvable. 


y 

y 

y 

y 


4a 

4b 

4c 

4d 


Problem 3.3.7 It is known that the system 

ay + bx = c; 
cx + az = b; 
bz + cy = a 

possesses a unique solution. What conditions must (a, b, c) g R 3 fulfill in this case? Find this unique solution. 


Problem 3.3.8 For which values of the real parameter a does the following system have (i) no solutions, (ii) exactly 
one solution, (iii) infinitely many solutions? 


a)x 

+ 

( 2 a + 1 )y + 

( 2 a + 2 )z = 

a, 

ax 

+ 

ay 

= 

2 a + 2 , 

2 x 

+ 

(a + 1 )y + 

(a — 1 )z = 

a 2 — 2 a + 9. 


Problem 3.3.9 Find strictly positive real numbers x, y , z such that 


x 3 y 2 z 6 


x 4 y 5 z 12 


x 2 y 2 z 5 


1 

2 

3. 


Problem 3.3.10 (Leningrad Mathematical Olympiad, 1987, Grade 5) The numbers 1,2 16 are arranged in 

a 4 x 4 matrix A as shewn below. We may add 1 to all the numbers of any row or subtract 1 from all numbers of 
any column. Using only the allowed operations, how can we obtain A T ? 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 


Problem 3.3.11 (International Mathematics Olympiad, 1963) Find all solutions xi ,X 2 ,X 3 ,X 4 ,X 5 of the system 

X5 + X2 = yx! ; 

Xi + x 3 = yx 2 ; 
x 2 + x 4 = yx 3 ; 

X 3 + x 5 = yx 4 ; 
x 4 + xt = yx 5 , 


where y is a parameter. 


Chapter 


Vector Spaces 


4.1 Vector Spaces 

179 Definition A vector space (V, +,*,F) over a field (F, +,) is a non-empty set V whose elements are 
called vectors, possessing two operations + (vector addition) , and • (scalar multiplication) which satisfy 
the following axioms. 

V(lt,l),lt) € V 3 , V(a, (3) 6 F 2 , 

VS1 Closure under vector addition : 



^ +b £ V, 

(4.1) 

VS2 Closure under scalar multiplication 

act £ V, 

(4.2) 

VS3 Commutativity 

it + 1) = + it 

(4.3) 

VS4 Associativity 

(it + b ) +lt =lt + (b + it ) 

(4.4) 

VS5 Existence of an additive identity 

3^ £V:lt-|-~3 > =lt-|-^=lt 

(4.5) 

VS6 Existence of additive inverses 

3 — it £ V : it + (— it) = (—it) + lt =~H 

(4.6) 

VS7 Distributive Law 

a(lt + 1j) = act + al? 

(4.7) 

VS8 Distributive Law 

( a + (3)lt = act + pit 

(4.8) 

VS9 

l F lt = a 

(4.9) 

VS10 

(ap)lt = a(plt) 

(4.10) 
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180 Example If n is a positive integer, then (F n , +, •, F) is a vector space by defining 

(a) , a 2) . . . , a„) + , t> 2 , . . . ,b n ) = (cq + bi , a 2 + b 2 , . . . , a n + b n ), 

A(ai , a 2) . . . , a n ) = (Acq , Aa 2) . . . , Aa n ). 

In particular, (Z| , + , • , Z 2 ) is a vector space with only four elements and we have seen the two-dimensional 
and tridimensional spaces (R 2 , +, •, R) and (R 3 , +,•,!). 

181 Example (M mxn (F), +, •, F) is a vector space under matrix addition and scalar multiplication of 
matrices. 

182 Example If 

F[x] = (cio + a-ix + a 2 x + • • • + a n x n : a t G F, ngf}] 

denotes the set of polynomials with coefficients in a field (F,+,) then (F[x], +, •, F) is a vector space, 
under polynomial addition and scalar multiplication of a polynomial. 

183 Example If 

F n [x] = {cio + dix + d 2 x + • • • + dkX k : di 6 F, n G N, k < n} 

denotes the set of polynomials with coefficients in a field (F, +, ) and degree at most n, then (F n [x], +, •, F) 
is a vector space, under polynomial addition and scalar multiplication of a polynomial. 

184 Example Let k 6 N and let c k (M a;b| ) denote the set of k-fold continuously differentiable real-valued 
functions defined on the interval [a; b]. Then C k (R |a;bl ) is a vector space under addition of functions and 
multiplication of a function by a scalar. 

185 Example Let p g]1; +oo[. Consider the set of sequences (a n }“ =0 , d n 6 C, 

{ OO 

( a n}“ =0 : Y- a *l P < +0 ° 
n=0 

Then l p is a vector space by defining addition as termwise addition of sequences and scalar multiplica- 
tion as termwise multiplication: 



{a n }“ o + {bn}“ o = {(^ + b n )}“ 0 , 

AfdntoHAdnto, ACC. 

All the axioms of a vector space follow trivially from the fact that we are adding complex numbers, 
except that we must prove that in l p there is closure under addition and scalar multiplication. Since 
Ln=0 l a nl P < +°0 =$> Ln= 0 |Aa n | p < +oo closure under scalar multiplication follows easily. To prove 
closure under addition, observe that if z G C then \z\ G R + and so by the Minkowski Inequality Theorem 
405 we have 

(Ln=olan + b n | p ) VP < 

< 

This in turn implies that the series on the left in (4.11) converges, and so we may take the limit as 
N — > +00 obtaining 

oo \VP / oo \Vp / oo \VP 

Y_ I a n + bn| P I < ^|a n | p + ( Y_ |bn| P ) ■ (4.12) 

n=0 / \n=0 / \n=0 / 

Now (4.12) implies that the sum of two sequences in l p is also in l p , which demonstrates closure under 
addition. 


(Ln=0lan| P 

(Ln=olanl P ) 


VP 

VP 


(Ln 0 Ibnl' 

(ln=0 |bnl P 


1/P 


, VP 


(4.11) 
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186 Example The set 

V = {a + b Vl + cx/3 : (a, b, c) E Q 3 } 

with addition defined as 

(a + hVl + cV3) + (a' + b'V2 + c'V3) = (a+a') + {b + b')Vl + ( c + c')V3 , 

and scalar multiplication defined as 

A(a + b Vl + cV3) = (Aa) + [Xb)Vl + (Acjx/I, 
constitutes a vector space over Q. 


187 Theorem In any vector space (V, + , •, F) , 


Proof: We have 

Hence 

or 


V a E F, <x0 = S . 


<x0 = tx(~(f +^) = <x0 + aH. 


aO — aO = aO, 


= aCL 


proving the theorem. □ 

1 88 Theorem In any vector space (V, 


•,F>, 

V ^ 6 V, = 1?. 


Proof: We have 

Therefore 


= (Of + Of = Of"x^ + Ofx?. 
Of^ — OpA? = Of^, 


0 = Of^, 

proving the theorem. □ 

189 Theorem In any vector space (V, +, -.F), a E F, A? E V, 

ocrf = S =$• a = Op V . 

Proof: Assume that a / 0 .-. Then a possesses a multiplicative inverse a -1 such that a 1 a = 

1 F . Thus 

on? = 0 =V a -1 ocrf = a -1 0 . 

By Theorem 188, a 1 ~Q = . Hence 

a -1 aT* = 0 . 

Since by Axiom 4.9, we have a -1 ocv = 1 fv = ~v , and so we conclude that A? = O' . □ 

1 90 Theorem In any vector space ( V, + , • , F) , 

Vex E F, V ^6 V, (— a)V = a(— V) = -(a^). 


Vector Subspaces 
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Proof: We have 

whence 

that is 

Similarly, 

whence 

that is 

proving the theorem. □ 


OfV = (a+ (— a)Y$ = crv? + (— a)At, 
-(a^) + 0 F ^ = (— a)\^, 

— (aV) = (— a)V. 


0 = a(T? — T?) = aV + a(— Tt), 
— (aV) + = a(— ~v), 

— (a^) = a(— 


Homework 


Problem 4.1.1 Is R 2 with vector addition and scalar 
multiplication defined as 


■ 








' 

Xi 

+ 

yi 

= 

Xi +yi 

, A 

Xi 

= 

Axi 

X2 


V2 


X2 +V2 


X2 


0 


a vector space? 

Problem 4.1.2 Demonstrate that the commutativity ax- 
iom 4.3 is redundant. 

Problem 4.1.3 Let V = R + =]0;+oo[, the positive real 
numbers and F = R, the real numbers. Demonstrate that 


V is a vector space over F if vector addition is defined as 
a © b = ab, (a, b) g (R + ) 2 and scalar multiplication is 
defined asa®a = o“, (a, a) g (R, R + ) . 

Problem 4.1.4 Let C denote the complex numbers and 
R denote the real numbers. Is C a vector space over R 
under ordinary addition and multiplication? Is R a vector 
space over C? 

Problem 4.1.5 Construct a vector space with exactly 8 
elements. 

Problem 4.1.6 Construct a vector space with exactly 9 
elements. 


4.2 Vector Subspaces 

191 Definition Let (V, +, -,F) be a vector space. A non-empty subset U C V which is also a vector space 
under the inherited operations of V is called a vector subspace ofV. 

192 Example Trivially, Xi = (t) 1 and X 2 = V are vector subspaces of V. 

193 Theorem Let (V, + , *,F) be a vector space. Then U C V, U y 0 is a subspace of V if and only if 
Va € F and Vfit, it) E U 2 it is verified that 

~<t + at) 6 U. 


Proof: Observe that U inherits commutativity, associativity and the distributive laws from V. 

Thus a non-empty UC Visa vector subspace ofV if (i)\l is closed under scalar multiplication, 
that is, if a E F and v E U, then a v E U; (ii) U is closed under vector addition, that is, if 
( it , " V* ) e U 2 , then hi + V 6 11. Observe that (i) gives the existence of inverses in U, for take 
a = — 1 f and so v 6 II ==> —V E U. This coupled with (ii) gives the existence of the zero-vector, 
for 0 = V -V E U. Thus we need to prove that if a non-empty subset ofV satisfies the property 
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stated in the Theorem then it is closed under scalar multiplication and vector addition, and vice- 
versa, if a non-empty subset ofV is closed under scalar multiplication and vector addition, then 
it satisfies the property stated in the Theorem. But this is trivial. □ 

194 Example Shew that X = {A E M nxn (F) : tr (A) = Of} is a subspace of M nxn (F). 

Solution: ► Take A, B E X, a E R. Then 

tr (A + aB) = tr (A) + atr (B) = Of + cx(Of) = Op. 

Hence A + aB E X, meaning thatX is a subspace o/'M llxn (F). ◄ 

195 Example Let U E M nxn (F) be an arbitrary but fixed. Shew that 

^u = {A e M nxn (F) : AU = UA} 

is a subspace of M nxn (F). 

Solution: ► Take (A, B) E ( < ^u) 2 - Then AU = UA and BU = UB. Now 

(A + aB)U = AU + aBU = UA + aUB = U(A + aB), 

meaning that A + aB E T’u . Hence %u is a subspace o/'M nxn (F). T’u is called the commutator 
of U. ◄ 

1 96 Theorem Let X C V, Y C V be vector subspaces of a vector space ( V, + , • , F) . Then their intersection 
X n Y is also a vector subspace of V. 

Proof: Let a E F and ) E (X n Y) 2 . Then clearly ( c? , b ) E X and ( c} , b ) E Y. Since X is 

a vector subspace, cf + ab 6 X and since Y is a vector subspace, H + ab e Y. Thus 

1? + air e X n Y 

and so XnY is a vector subspace ofV by virtue of Theorem 1 93. □ 

We we will soon see that the only vector subspaces of (R 2 , + , • , R) are the set containing the 
zero-vector, any line through the origin, and R 2 itself. The only vector subspaces of (R 3 , + , •, IB.) 
are the set containing the zero-vector, any line through the origin, any plane containing the origin 
and R 3 itself. 


Homework 


Problem 4.2.1 Prove that 


' 


•> 


a 



b 

GR 4 :a — b — 3d = 0 


c 



d 



is a vector subspace of R 4 . 


Problem 4.2.2 Prove that 


' 


> 


a 



2a -3b 



5b 

: a, b G K 


a + 2b 



a 

> 


is a vector subspace of R 5 . 
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Problem 4.2.3 Let A G M mxn (F) be a fixed matrix. 
Demonstrate that 

S ={X G M„xi(F) : AX = O m x i } 

is a subspace of M n xi (F). 


© 


b 

0 


a,b G R 2 , ab = 0 \ C m3 


Problem 4.2.4 Prove that the set X C M n xn(F) of upper 
triangular matrices is a subspace of M n xn(F). 


© 


a b 
0 0 


: (a, b) G R 2 , cl + b“ = 0 S C M2x2 ( 


Problem 4.2.5 Prove that the set X C M nxn (F) of sym- 
metrlc matrices is a subspace of M nX n(F). 


Problem 4.2.6 Prove that the set X C M n xn(F) of skew- 
symmetric matrices is a subspace of M nX n(F). 


Problem 4.2.8 Let (V, +, •, F) be a vector space, and let 
111 C V and U2 C V be vector subspaces. Prove that if 
Ui U U2 is a vector subspace of V, then either Ui C U2 
or U2CU1. 


Problem 4.2.7 Prove that the following subsets are not 
subspaces of the given vector space. Here you must say 
which of the axioms for a vector space fall. 



( 


> 



a 


0 


b 

: cl, l) G K., ci + b 2 = 1 > 


V 

0 



Problem 4.2.9 Let V a vector space over a field F. If F is 
infinite, show that V is not the set-theoretic union of a 
finite number of proper subspaces. 

Problem 4.2.10 Give an example of a finite vector space 
V over a finite field F such that 

V = V, u V 2 U v 3l 

where the Vk are proper subspaces. 


4.3 Linear Independence 

197 Definition Let (Ai , A 2 , • • • , A n ) gF". Then the vectorial sum 

n 

X A ^i 

i=i 

is said to be a linear combination of the vectors Id 6 V, 1 < i < n. 


198 Example Any matrix 


a 


c 


b 

d 


£ M 2 x 2 ( R ) can be written as a linear combination of the matrices 


1 0 
0 0 


0 1 
0 0 


0 0 
1 0 


0 0 
0 1 


a b 


1 

0 


0 

1 


0 

0 


0 

0 


= a 



+ b 



+ C 



+ d 



c d 


0 

0 


0 

0 


1 

0 


0 

1 


199 Example Any polynomial of degree at most 2, say a + bx + cx 2 £ R 2 M can be written as a linear 
combination of 1 , x — 1 , and x 2 — x + 2, for 

a + bx + cx 2 = (a — c) (1 ) + (b + c) (x — 1 ) + c(x 2 — x + 2). 
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Generalising the notion of two parallel vectors, we have 

200 Definition The vectors 6 V, 1 < i < n, are linearly dependent or tied if 

n 

3(Ai , A 2 , • • • , A n ) 6 F n \ {0} such that Aj^j 

j-i 

that is, if there is a non-trivial linear combination of them adding to the zero vector. 

201 Definition The vectors 6 V, 1 < i < n, are linearly independent or free if they are not linearly 
dependent. That is, if (Ai , A 2 , • • • , A n ) 6 F n then 

n 

Aj"^j = 0 => At = A 2 = • • • = A n = Op. 
j=i 


A family of vectors is linearly independent if and only if the only linear combination of them 
giving the zero-vector is the trivial linear combination. 


202 Example 



1 


4 


7 

> 


2 

> 

5 

> 

8 



3 


6 


9 



is a tied family of vectors in K 3 , since 


1 


4 


7 


0 

2 

+ (-2) 

5 

+ (D 

8 

= 

0 

3 


6 


9 


0 


203 Example Let it, be linearly independent vectors in some vector space over a field F with charac- 
teristic different from 2. Shew that the two new vectors 3c = it — V and => it + V are also linearly 
independent. 


Solution: ► Assume that a(rt — V) + b(rt + V) = 0. Then 

(a + b)rt + (a — b)T? = ^. 

Since rf , v' are linearly independent, the above coefficients must be 0, that is, a + b = Of and 
a — b = Op. But this gives 2 a = 2b = Op, which implies a = b = 0-, if the characteristic of the field 
is not 2. This proves the linear independence ofu — V and it + v . ◄ 

204 Theorem Let A e M mxn (F). Then the columns of A are linearly independent if and only the only 
solution to the system AX = O m is the trivial solution. 

Proof: Let Aj , , A n be the columns of A. Since 


xi Ai + X 2 A 2 + • • • + x n A n = AX, 


the result follows. □ 


Linear Independence 
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205 Theorem Any family 

containing the zero-vector is linearly dependent. 

Proof: This follows at once by observing that 

If 0 + Of tii + 0 fV 2 + . . . , +0F^ttk = 0 

is a non-trivial linear combination of these vectors equalling the zero-vector. □ 


Homework 


Problem 4.3.1 Shew that 



forms a free family of vectors in R 3 . 


Problem 4.3.5 Let (am,^V 2,^V3,^V4) be a linearly inde- 
pendent family of vectors. Prove that the family 

{Vi + V 2 ,V 2 + V 3 , V 3 + V 4 ,V 4 + 3? "1 } 

is not linearly independent. 


Problem 4.3.6 Let {Vi ,^2, vs} be linearly independent 
vectors in R 5 . Are the vectors 

Vi = 3V1 + 2V2 + 4V3 , 


Problem 4.3.2 Prove that the set 



1 


1 


1 


1 

' 


1 

> 

1 

> 

-1 

> 

1 



1 


-1 


1 


0 



1 


-1 


-1 


1 



is a linearly independent set of vectors in R 4 and shew 


that X 


2 


can be written as a linear combination of 


1 


b 2 = Vi + 4V 2 + 2 V 3 , 
V 3 = 9V1 + 4V2 + 3V3, 
V 4 =Vi +2V2 +5V3, 

linearly independent? Prove or disprove! 


Problem 4.3.7 Is the family {1 , \/l\ linearly independent 
over Q? 


Problem 4.3.8 Is the family {1 , Vl} linearly independent 
over R? 

Problem 4.3.9 Consider the vector space 

V = (a + b \fl -)- cV3 : (a, b, c) g Q 3 }. 

1 . Shew that (1 , fl, y/3} are linearly independent 
over Q. 


1 

these vectors. 

Problem 4.3.3 Let (ii,V) g (R n ) 2 . Prove that V • V = 
| |V 1 1 1 |V 1 1 if and only if it and V are linearly dependent. 


2. Express 


1 


i - vi vn — 2 

as a linear combination of {1 , \/2, a/3}- 


Problem 4.3.4 Prove that 



is a linearly independent family over R. Write 
a linear combination of these matrices. 


Problem 4.3.10 Let f , g, ti belong to C°° (R r ) (the space 
of infinitely continuously differentiable real-valued func- 
tions defined on the real line) and be given by 

f(x) = e x , g(x) = e 2x , h(x) = e 3x . 

Shew that f , g , ti are linearly independent over R. 

Problem 4.3.11 Let f , g, h belong to C°° (R r ) be given by 
f (x) = cos 2 x, g (x) = sin 2 x, h(x) = cos2x. 

Shew that f , g , ti are linearly dependent over R. 
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4.4 Spanning Sets 

206 Definition A family {iti , tt 2 > . . . , itk, . . . , } C V is said to span or generate V if every V 6 V can be 
written as a linear combination of the itj’s. 

207 Theorem If {iti , tt 2 , • • • ,t^k, . . . , } C V spans V, then any superset 


also spans V. 

Proof: This follows at once from 


1 1 

Y + Y. 

i=1 i=1 

□ 


208 Example The family of vectors 


f 

1 


0 


0 

\ 

— » 


— > 


~rf 



i = 

0 

, 1 = 

1 

, h = 

0 



0 


0 


1 



spans Hr since given 


we may write 


a 

b 


— » 

a i 


bj 


ct. 


c 


209 Example Prove that the family of vectors 



1 


1 


1 

\ 

H-X 

II 

0 

,t 2 - 

1 

,t 3 - 

1 



0 


0 


1 



spans K 3 . 
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Solution: ► This follows from the identity 


a 


1 


1 


1 

b 

= (a-b) 

0 

+ (b - c) 

1 

+ c 

1 

c 


0 


0 


1 




(b — c) 1 2 


cT,?- 


210 Example Since 


a b 


1 0 


0 

1 


0 0 


0 0 


= a 


+ b 



+ C 


+ d 


c d 


0 0 


0 

0 


1 0 


0 1 


the set of matrices 


1 0 
0 0 


0 1 
0 0 


0 0 
1 0 


0 0 
0 1 


is a spanning set for M 2 x 2 OK)- 


21 1 Example The set 


{l,x,x 2 ,x 3 


,xV..} 


spans R[x], the set of polynomials with real coefficients and indeterminate x. 


212 Definition The span of a family of vectors (iti , 1 ^ 2 , Ti k is the set of all finite linear combina- 
tions obtained from the Ut's. We denote the span of {rti ,^ 2 , . . . , Ti k , . . . , } by 

span (T^-i rt 2 ,.. • • 


213 Theorem Let (V, + , •, F) be a vector space. Then 

span (T^ , rt 2 , • • • ,^k, • • • , ) C V 


is a vector subspace of V. 

Proof: Let a E F and let 

1 1 

5? = Y cikd’k, 1 / = Y b k tt k , 

k=1 k=1 

be in span (iti ,^2, • ■ • , ri k , . . . , ) (some of the coefficients might be Of). Then 


5? + or? = ^(a k + ab k )it k € span (iti ,vt 2 , ■ ■ ■ ,^k, •••,), 

k=1 

and so span (T?i ,^ 2 , • • • ,^k, . . . , ) is a subspace ofV. □ 

214 Corollary span ( T?i , 1 ^ 2 , . . . ,lt k , . . . , ) C V is the smallest vector subspace of V (in the sense of set 
inclusion) containing the set 


Proof: IfW C V is a vector subspace ofV containing the set 

then it contains every finite linear combination of them, and hence, it contains span ( rf t , If 2 , • • • , d? k , • ■ • 

□ 
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215 Example If A e M 2X 2 (R). A e span 


I 0 


0 0 


0 1 


then A has the form 




vL° 

0 

0 1 

- 

1 

0 

/ 


1 0 


1 

O 

O 

1 


0 

1 


a c 

a 

0 0 

+ b 

0 1 

+ C 

1 

0 


c b 


i.e., this family spans the set of all symmetric 2x2 matrices over M. 

216 Theorem Let V be a vector space over a field F and let (V, v^) 6 V 2 , ygF \ {Op}- Then 

span ,v&) = span (T > ,yhi') . 


Proof: The equality 

proves the statement. □ 


a/v? + bv^ = cfv? + (by 1 ) (yr^) , 


217 Theorem Let V be a vector space over a field F and let (T?, vi’) 6 V 2 , y E F. Then 

span = span + yv$) . 


Proof: This follows from the equality 

cfv^ + Imi’ = apt? + yv$) + (b — ay)v^. 


□ 


Homework 


Problem 4.4.1 Let M3 [x] denote the set of polynomials with degree at most 3 and real coefficients. Prove that the 
set 

{i,i +x,ci +x) 2 ,n +x) 3 } 

spans M3 [x] . 



" 


/ 




\ 


1 


1 


0 

Problem 4.4.2 Shew that 

1 

M span 


0 

) 

1 



-1 


V 

-1 


-1 

/ 

/ 

1 0 

0 

0 

- 

0 

1 


Problem 4.4.3 What is span 


0 0 


0 1 


-1 0 


■J / 


Problem 4.4.4 Prove that 


span 


( 

1 0 


1 0 


0 1 


0 1 

\ 

V 

0 1 

> 

0 -1 

> 

1 0 

} 

-1 0 

/ 


= M 2 X 2 ( 
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Problem 4.4.5 For the vectors in ] 


prove that 



1 


1 


1 


3 

~ct = 

2 

, = 

3 


1 


8 


1 


2 


0 


5 


span 



= span 



,11k, • • •} Q V is said to be a basis of V if (i) they are linearly indepen- 

Of 

Of 

e i = Ip > 

Of 

Of 

where there is a 1 F on the i-th slot and Of’s on the other n — 1 positions, is a basis for F n . 

220 Theorem Let (V, + , •, F) be a vector space and let 

11 = 111! ,1^2) • • ■ , t^k, . . .} C V 

be a family of linearly independent vectors in V which is maximal in the sense that if U' is any other 
family of vectors of V properly containing U then 11' is a dependent family. Then U forms a basis for V. 

Proof: Since U is a linearly independent family, we need only to prove that it spans V. Take 

V 6 V. If V 6 U then there is nothing to prove, so assume that V e V \ U. Consider the set 
U' = Uu{l}. This set properly contains U. and so, by assumption, it forms a dependent family. 
There exists scalars <xo , ai , . . . , d n such that 

doV + ailli + • • • + d n ll n = S . 

Now, ao f Of, otherwise thefli would be linearly dependent. Hence a 0 1 exists and we have 

T? = — do 1 (dl ill H f CX n lln), 

and so the tl; span V. □ 

From Theorem 220 it follows that to shew that a vector space has a basis it is enough to 
shew that it has a maximal linearly independent set of vectors. Such a proof requires something 
called Zorn’s Lemma, and it is beyond our scope. We dodge the whole business by taking as an 
axiom that every vector space possesses a basis. 


4.5 Bases 

218 Definition A family (lli ,1^2, ■■ ■ 
dent, (ii) they span V. 

219 Example The family 
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221 Theorem (Steinitz Replacement Theorem) Let (V, +, •, F) be a vector space and let U = {iti ,T t 2 , . . .} C 
V. Let W = {v^i ,v^ 2 , • • • , vt’k 1 be an independent family of vectors in span (U). Then there exist k of the 
Ifi’s, say (iti , it 2 , • • • , hi k j which may be replaced by the w,’s in such a way that 

span (wt , w 2 , . . . , v^ k ,ht k+1 ,...)= span (U) . 


Proof: We prove this by induction on k. If k = 1 , then 

= op Tt ! + a 2 it 2 H + 

for some n and scalars <x\. There is an oa f CL, since otherwise contrary to the 

assumption that thewi are linearly independent. After reordering, we may assume that «i =/ O v. 
Hence 

I?! = af 1 (v^! — (a 2 tt 2 H h Onht n )), 

and sot?! G span (v^i , it 2 , . . . , ) and 

span ,\t 2 , ...,)= s P an C^i .^2, 


Assume now that the theorem is true for any set of fewer than k independent vectors. We may 
thus assume that that {it i , . . .} has more than k — 1 vectors and that 

span (v^t , v& 2 , ■ ■ ■ .v^k-i ,ht k , ,...)= span (U) . 

Since vV k G U we have 

= Pi v^l + 02^2 + • • • + pk-l^k-1 + Ykhtk + Yk+lhtk+1 + Ymhtm- 

If all the yi = Of, then the {v$i , vV 2 , . . . , v^ k ) would be linearly dependent, contrary to assumption. 

Thus there is a Yi ^ Of, and after reordering, we may assume that y k -fO.. We have therefore 

lt k = Yk’^k — (PlW! + p 2 w 2 H b Pk-lWk-1 + Yk+ltt k+1 + Ymlt m )). 

But this means that 

span (wt ,w 2 , . . . ,v^ k ,ht k+1 ,...) = span (U) . 

This finishes the proof . □ 

222 Corollary Let {v^i , v^ 2 , . . . , vV n } be an independent family of vectors with V = span (vi>i , v^ 2 , . . . , w n ) . 
If we also have V = span (hti ,ht 2 , . . . ,ht-v) , then 

1. n < v, 

2. n = v if and only if the \ lti,ht 2 ,...,h/-v} are a linearly independent family. 

3. Any basis for V has exactly n elements. 

Proof: 

1. In the Steinitz Replacement Theorem 221 replace the first n tt t ’s by the v^t’s and n < v 
follows. 

2. f/{iti,it 2j ...,hU are a linearly independent family, then we may interchange the role of 
the and iti obtaining v < n. Conversely, ifv = n and if the it t are dependent, we could 
express some it ; as a linear combination of the remaining v — 1 vectors, and thus we would 
have shewn that some v — 1 vectors span V. From (1) in this corollary we would conclude 
that n < v 1 , contradicting n = v. 

3. This follows from the definition of what a basis is and from (2) of this corollary. 
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□ 

223 Definition The dimension of a vector space (V, + , •, F) is the number of elements of any of its bases, 
and we denote it by dim V. 

224 Theorem Any linearly independent family of vectors 

fxi,x 2 , . . . ,lc k } 

in a vector space V can be completed into a family 

,S? 2 , • • ■ ,^k,^ k+1 ,^k+2> • • ■} 
so that this latter family become a basis for V. 

Proof: Take any basis {Tt i Tt k , Ti' k } and use Steinitz Replacement Theorem 221 . 

□ 

225 Corollary If U C V is a vector subspace of a finite dimensional vector space V then dim U < dim V. 

Proof: Since any basis of U can be extended to a basis ofV, it follows that the number of 

elements of the basis of U is at most as large as that for V. □ 

226 Example Find a basis and the dimension of the space generated by the set of symmetric matrices in 

Mnxn(R) • 


Solution: ► Let E t j 6 M nxn (M) be the n x n matrix with a 1 on the ij -th position and 0’s 

n(n — 1 ) 

everywhere else. For 1 < i < j < n, consider the ( 2 ) = matrices A,j = E t j + Ej t . The 

Atj have a 1 on the ij- th and ji -th position and 0’s everywhere else. They, together with the n 
matrices E u , 1 < i < n constitute a basis for the space of symmetric matrices. The dimension of 
this space is thus 

n(n — 1 ) n(n + 1 ) 

1- n = . 

2 2 

◄ 


227 Theorem Let {it i , . . . ,lt n } be vectors in lR n . Then the it’s form a basis if and only if the n x n matrix 
A formed by taking the it’s as the columns of A is invertible. 


Proof: Since we have the right number of vectors, it is enough to prove that the it ’s are linearly 

xi 


independent. But ifX = 


X 2 


then 


xi it] H hx n lt n = AX. 

If A is invertible, then AX = 0 n => X = A 1 0 n = 0 n , meaning that xi = X 2 = • • • x n = 0, so the 
u ’s are linearly independent. 


Conversely, assume that the it ’s are linearly independent. Then the equation AX = O n has 
a unique solution. Let r = rank (A) and let (P, Q) 6 (GL n (K)) 2 be matrices such that A = 
P 1 D n n r Q 1 , where D nnr is the Hermite normal form of A. Thus 


90 


Chapter 4 


AX = O n =* P- 1 D n , n , r Q- 1 X = O n =* D n n r Q 1 X = O n . 


Put Q 1 X = 


yi 


V2 


Vr 


. Then 


D n ,n, r Q X — 0,1 — / y^ e i + • • • + y r £ r - 0 n , 

where is the n-dimensional column vector with a 1 on the j-th slot and O’s everywhere else. 
If r < n then y r+ i , . . . , y n can be taken arbitrarily and so there would not be a unique solution, 
a contradiction. Hence r = n and A is invertible. □ 


Homework 

Problem 4.5.1 In problem 4.2.2 we saw that 


' 




a 



2a -3b 



5b 

: a, b G R 


a + 2b 



a 



is a vector subspace of R 5 . Find a basis for this subspace. 


Problem 4.5.2 Let ,^ 2,^3 ,^ 4 ,^ 5 } be a basis for a vector space V over a field F. Prove that 

{Vi +^2,^2 +^3,^3 + \?4,^4 +^5,^5 +^l} 


is also a basis for V. 


Problem 4.5.3 Find a basis for the solution space of the system of n + 1 linear equations of 2n unknowns 

Xi +X 2 H + x n = 0, 

X 2 +X3 H +Xn+1 = 0, 


x n + l +Xn +2 H + X 2 n = 0. 

Problem 4.5.4 Prove that the set V of skew-symmetric n x n matrices is a subspace of M n xn(R) and find its 
dimension. Exhibit a basis for V. 

Problem 4.5.5 Prove that the set 

X = {(a, b, c, d)|b + 2c = 0} C R 4 
is a vector subspace of R 4 . Find its dimension and a basis for X. 

Problem 4.5.6 Prove that the dimension of the vector subspace of lower triangular n x n matrices is ^ ^ and 

find a basis for this space. 
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Problem 4.5.7 Find a basis and the dimension of 



( 

1 


1 


2 

\ 

X = span 

vt = 

1 

, V2 = 

1 

, v2- 

2 




1 


1 


2 




1 


0 


1 

/ 

Problem 4.5.8 Find a basis and the dimension of 








/ 

1 


1 


2 

\ 

X = span 

vt = 

1 

, V2 = 

1 

, V3 = 

2 




1 


1 


2 




1 


0 


2 

/ 


Problem 4.5.9 Find a basis and the dimension of 


X = span 

( 

vT = 

i 

0 

, v2 = 

l 

0 

, V3 = 

0 

1 

, V4 = 

1 

-1 

\ 



0 

1 


2 

0 


2 

0 


0 

0 

/ 


Problem 4.5.10 Prove that the set 



f 


> 



a b c 


v= . 


0 d f 

E M 3 x 3 (I^)*CL + t)H-c = 0 ) a + d + g = 0 


< 

0 0 g 



is a vector space of M3 x 3 (R) and find a basis for it and its dimension. 


4.6 Coordinates 

228 Theorem Let • ■ ■ ,^n} be a basis for a vector space V. Then any 3? 6 V has a unique 

representation 

T? = a-iT^i + a 2 ^2 H 1- a n ^n- 


Proof: Let 

T? = + b 2 ^2 H + brCV^n 

be another representation of v' . Then 

= (ai - bi + (a 2 - b 2 )V 2 H b (an — b n )V n . 

Since f$ 2 , • • • , f n 1 forms a basis for V, they are a linearly independent family. Thus we 
must have 

ai — bi = a 2 — b 2 = • • • = a n — b n = 0 F) 

that is 

ai = bi; a 2 = b 2 ; - • • ; a n = b n , 

proving uniqueness. □ 
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229 Definition An ordered basis ,^ 2 , ■ • • ,^n} of a vector space V is a basis where the order of the 
has been fixed. Given an ordered basis ,^ 2 , • ■ • W n ) of a vector space V, Theorem 228 ensures that 
there are unique (a 1 , ai , . . . , a n ) £ F n such that 


v = dl v i 


02 v 2 


+ a„v, 


The a k ’s are called the coordinates of the vector v . 


-j — ) — ^ ^ 

230 Example The standard ordered basis for R 3 is 5 ? = { i , j , k}. The vector 


6 R 3 for example, has 


coordinates (1 , 2, 3)^. If the order of the basis were changed to the ordered basis = { i , k , ) }, then 

1 

2 € R 3 would have coordinates (1,3,2)^. 

3 


Usually, when we give a coordinate representation for a vector T? 6 R n , we assume that 
we are using the standard basis. 


231 Example Consider the vector 


6 R 3 (given in standard representation) . Since 


under the ordered basis 



1 


1 


1 


i 


2 

= -1 

0 

1 

1 

+ 3 

i 


3 


0 


0 


i 








1 

1 

1 


1 




= 




has coordinates (— 1 


— 1 , 3)^, . We write 


1 



2 

= 

-1 

3 


3 
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232 Example The vectors of 


#1 = 


are non-parallel, and so form a basis for M 2 . So do the vectors 


S&2 


find the coordinates of 


in the base 


Solution: ► We are seeking x, y such that 


= x 


1 1 


Thus 


1 

— 

1 


1 

2 


4 











X 


2 

1 


1 

1 


3 

y 

S&2 

1 

-1 


1 

2 


4 


2 1 
1 -1 


3&2 



1 1 


3 


1 2 


4 




6 

-5 


Lei us check by expressing both vectors in the standard basis of R 2 ; 


3 

= 3 

1 

+ 4 

1 


7 

4 


1 


2 


11 


◄ 









6 

= 6 

2 

-5 

1 

= 

7 

~5 

S&2 

1 


-1 


11 
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In general let us consider bases 38 1 , 38 z for the same vector space V. We want to convert Xag, to 
Y ag 2 . We let A be the matrix formed with the column vectors of 38-\ in the given order an B be the matrix 
formed with the column vectors of 382 in the given order. Both A and B are invertible matrices since the 
38 ' s are bases, in view of Theorem 227. Then we must have 


AXj, =BY* 2 =» Y* 2 =B- 1 AX^ 1 . 


Also, 

X», = A 1 BY^ 2 . 


This prompts the following definition. 

233 Definition Let 38-\ = (iti , 1 ^ 2 , • ■ • and S 82 = ,^ 2 , ■ ■ • »^n} be two ordered bases for a vector 

space V. Let A 6 M nxn (F) be the matrix having the it's as its columns and let B 6 M nxn (F) be the 
matrix having the ^’s as its columns. The matrix P = B 1 A is called the transition matrix from 38 1 to S 82 
and the matrix P 1 = A 1 B is called the transition matrix from 38z to 38-\ . 

234 Example Consider the bases of R 3 




1 


1 


2 

> 


1 

) 

-1 

> 

0 



-1 


0 


0 



Find the transition matrix from 38-\ to 38z and also the transition matrix from S 82 to S 8 -\ . Also find the 

1 


coordinates of 


in terms of 382 - 


1 

1 

1 


1 

1 

2 

1 

1 

0 

, B = 

1 

-1 

0 

1 

0 

0 


-1 

0 

0 


Solution: ► Let 
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The transition matrix from 88-\ to S &2 is 


P = B 1 A 





— 





1 

1 

2 


1 

1 

1 

1 

-1 

0 


1 

1 

0 

-1 

0 

0 


1 

0 

0 

0 

0 

-1 


1 

1 

1 


0 

-1 

-1 


1 

1 

0 


1 

2 

1 

2 

1 


1 

0 

0 



-TOO 


-2 -1 


-0 • 


21 1 

The transition matrix from d §2 to S3 1 is 


-1 

0 

0 

-1 


0 

0 

-2 

-1 

0 

= 

2 

-1 

0 

2 

1 

1 

2_ 


0 

2 

2 



-1 

0 

0 


1 


-1 

y .® 2 = 

-2 

-1 

0 


2 

= 

-4 


2 

1 

1 

2 _ 


3 

i 

1 1 

2 


As a check, observe that in the standard basis for M 3 


◄ 



-1 

= -i 

1 


1 

n 

2 


6 

-4 

1 

-4 

-1 

+ T 

0 

— 

3 

1 1 

2 


-1 


0 


0 


1 


S &2 
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Homework 


Problem 4.6.1 1. Prove that the following vectors are 

linearly Independent In R 4 



1 


1 


1 


1 

itl = 

1 

, ^2 = 

1 

, ^3 = 

-1 

II 

-1 


1 


-1 


1 


-1 


1 


-1 


-1 


1 


2. Find the coordinates of 


SiS fc^ , li 2 , ^3,^4). 


under the ordered ba- 


A(a) = 


Problem 4.6.2 Consider the matrix 

a 1 1 1 

0 10 1 
1 0 a 1 

1111 

O Determine all a for which A (a) is not invertible. 
© Find the inverse of A(a) when A(a) is invertible. 
© Find the transition matrix from the basis 


= 


1 


1 


1 


1 

1 


1 


1 


0 

1 


1 


0 


0 

1 


0 


0 


0 


to the basis 


3. Find the coordinates of 


sis ,^ 3 ,^2,~^4 


under the ordered ba- 


3B 2 = 


a 


1 


1 


1 

0 


1 


0 


1 

1 


0 


a 


1 

1 


1 


1 


1 
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Linear Transformations 


5.1 Linear Transformations 

235 Definition Let (V, +, •, F) and (W, +, •, F) be vector spaces over the same field F. A linear transforma- 
tion or homomorphism 

V -> W 
L: 

^ (-> L(ii) 

is a function which is 

• Linear: L(li + ~t>) = L(li) + L(t>), 

• Homogeneous: L(at£) = aLfS), for «gF. 


It is clear that the above two conditions can be summarised conveniently into 


L(li + ab) = L(l2) + aL(b). 


236 Example Let 


M nxn (R) — > R 

L : 

A i — > tr (A) 


Then L is linear, for if (A,B) € M nxn (R), then 

L(A + aB) = tr (A + aB) = tr (A) + atr (B) = L(A) + «L(B). 


237 Example Let 


M nxn (R) — > M nxn (R) 

L : 

A i-4 A t 

Then L is linear, for if (A,B) E M nxn (R), then 

L(A + aB) = (A + aB) T = A T + aB T = L(A) + aL(B). 
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238 Example For a point (x,y) & l 2 , its reflexion about the y-axis is (— x,y). Prove that 

R 2 -> R 2 

R : 

(x,y) (-4 (— x,y) 

is linear. 

Solution: ► Let (xi , yi ) 6 M 2 , (% 2 , V 2 ) € R 2 . and agl. Then 

R((*i,yi) + a(x 2 ,y 2 )) = R((xt + ax 2 ,yi + ay 2 )) 

= (— (XT + ax 2 ),yi + ay 2 ) 

= (— xi,yi) + a( — x 2 ,y 2 ) 


= R((xi,yi)) + aR((x 2 ,y 2 )), 


whence R is linear. ◄ 

239 Example Let L : R 2 — » R 4 be a linear transformation with 





-1 




2 

L 

1 


1 

; i- 



0 


1 


2 


i 


2 




3 




3 


Find L 


5 

3 


Solution: ► Since 


we have 


5 


1 


-1 


= 4 


— 


3 


1 


1 



5 


1 


-1 


= 4L 


- L 


3 


1 


1 


◄ 

240 Theorem Let (V, +, •, F) and (W, + , •, F) be vector spaces over the same field F, and let L : V W be 
a linear transformation. Then 
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• L( 0 v) = 0 w- 

• 6 V,L(-^) = — L("xb. 

Proof: We have 

hence 
Since 

we obtain the first result. 

Now 

= Lf3v) = L(^ + ( — xb) = L(xb + L( — xb, 
from where the second result follows. □ 


L(^ v ) = L("^v +^v) = L(^ v ) + L(^ v ), 


LfOv) -L(Ov) =L("0 V ) 
L(^v)-L(^v) =^w, 


Homework 


Problem 5.1.1 Consider L : R 3 — > IR 3 , 


X 


N 

y 

= 

x + y + z 

z 


z 


Prove that L is linear. 


Problem 5.1.2 Let A £ GL n (R) be a fixed matrix. Prove 


that 


Lxn (®0 

— > 

M n x n (®0 

H 


-A -1 HA -1 


is a linear transformation. 

Problem 5.1.3 Let V be a vector space and let S C V. 
The set S is said to be convex if Va £ [0;l],Vx,y £ S, 
(1 — a)x + ay £ S, that is, for any two points in S, 
the straight line joining them also belongs to S. Let 
T : V — > W be a linear transformation from the vector 
space V to the vector space W. Prove that T maps con- 
vex sets into convex sets. 


5.2 Kernel and Image of a Linear Transformation 

241 Definition Let (V, + , •, F) and (W, +, •, F) be vector spaces over the same field F, and 


V -> W 
T : 

T? i-> T(T?) 

be a linear transformation. The kernel of T is the set 

ker (T) = (V € V : Tftf) = ^ w }. 


The image of T is the set 

Im (T) = 6 w : 3T? £ V such that T(V) = = T(V). 


dr 


Since T(0y) = Ow by Theorem 240, we have 0 v € ker (T) and 0 w € Im (T) . 
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242 Theorem Let (V, +, •, F) and (W, +, •, F) be vector spaces over the same field F, and 

-4 W 
<-> Tfv) 

be a linear transformation. Then ker (T ) is a vector subspace of V and Im (T) is a vector subspace of 
W. 


T : 


V 


) 

V 


Proof: Let [A? 1,^2) G (ker(T)) 2 and a E F. Then T(T?i) = T(T?2) = Oy. We must prove that 

Vi + ocv 2 G ker(T), that is, that T(V-| +0^2) = Ow- Bat 

T(Vi + 0^2) = T ( V 1 ) + aT(T? 2 ) =^v + a^v = ^v 

proving that ker (T ) is a subspace ofV. 

Now, let (v^i,v^2) G (Im(T)) 2 and a E F. Then 3 (T?i,T? 2) G V 2 such that T(T?i) = and 
T f T? 2) = ft' 2 ■ We must prove that ft>i + avt» 2 G Im (T), that is, that 3 V such that T ( V*) = 
wi + aw2- But 

+ avo > 2 = T ( V 1 ) + aT(T? 2 ) = T(V?i + 0^2), 
and so we may take V = V 1 + on) 2 ■ This proves that Im ( T ) is a subspace of W. 

□ 

243 Theorem Let (V, + , •, F) and (W, + , •, F) be vector spaces over the same field F, and 


V -4 W 
T : 

"V (-4 T(T?) 


be a linear transformation. Then T is injective if and only if ker (T) = Oy. 


Proof: Assume that T is injective. Then there is a unique x E V mapping to 0 W : 

Tfxb = Ow- 

By Theorem 240 , T (~0 v ) = O'vv. i.e., a linear transformation takes the zero vector of one space 
to the zero vector of the target space, and so we must have V = 0 v- 

Conversely, assume that ker (T) = {C^v}. and that. T (3? ) = T(j|). We must prove that - f. But 


T(^)=T(^) =* 



TfV) — T(^) = 0 w 

- V) G ker (T) 
x-^=0v 

?=t, 


as we wanted to shew. □ 
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244 Theorem (Dimension Theorem) Let (V, +,-,F) and (W, +,-,F) be vector spaces of finite dimension 
over the same field F, and 

V -> W 
T : 

T? 1-4 T(T?) 

be a linear transformation. Then 

dimker (T) + dimlm (T) = dim V. 


Proof: Let fv^i,^ 2 ,...,^k } be a basis for ker (T). By virtue of Theorem 224, we may extend 
this to a basis = {Vi ,V 2) ■ ■ ■ ,V k ,Vi< + i ,^k+ 2 , ■ • ■ , v n ) ofV. Here n = dim V. We will now 
shew that SB = {T(V k+1 ), T(V k+2 ), ■ • • ,T(V n ) 1 is a basis for Im (T) . We prove that (i) SB spans 
Im (T), and (ii) SB is a linearly independent family. 

Let\ti 6 Im (T ) . Then 3 T 6 V such that T (V ) = vV. Now since /// is a basis for V we can write 

n 

V = Y 

i=l 

Hence 

n n 

= T(V) = Y_ a tT(^i) = Y a i T (V t ), 

i=1 i=k+1 

since T(Vt) =l$v for 1 < i < k. Thus SB spans Im (T). 

To prove the linear independence of the SB assume that 

n / n 

0w= Y. Pi^(^v) = Tl Y_ Pi^i 

t=k+1 \i=k+1 

This means that k 1 P fS i 6 ker (T), which is impossible unless p k+ i = Pk 1 2 = • • • = Pn = 

Of- 

□ 

245 Corollary If dim V = dim W < + 00 , then T is injective if and only if it is suijective. 

Proof: Simply observe that if T is injective then dimker (T) =0, and if T is surjective Im (T) = 

T(V) = W and Im (T) = dim W. □ 

246 Example Let 



m 2x2 (R) — » 

L : 

A 1-4 


M2x2(R) 

A t - A 


Observe that L is linear. Determine ker ( L) and Im (L) . 


Solution: ► Put A 


a b 


and assume L(A) = 0 2 . Then 


0 0 
0 0 


= L(A) 


a c 


a b 


0 1 


— 


II 

O 

1 

Cf 


b d 


c d 


1 

O 

1 


d 
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This means that c = b . Thus 


ker (L) 


a b 
b d 


: (a, b, d) 6 


Im (L) 


: k <E 


0 k 
-k 0 

This means that dim ker (L) =3, and so dim Im (L) = 4 — 3 = 1. ◄ 

247 Example Consider the linear transformation L : M2 X 2(R) — » M3IXI given by 


a b 
c d 


= (a + b)X 2 + (a — b)X 3 


Determine ker (L) and Im (L) 

Solution: ► We have 


0 = L 


a b 
c d 


= (a + b)X 2 + (a-b)X 3 


a + b = 0, a — b = 0, =4> a = b = 0. 


ker (L) 


: (c, d) 6 


Thus 

0 0 
c d 

Thus dim ker (L) = 2 and hence dimlm (L) = 2. Now 

(a + b)X 2 + (a-b)X 3 a(X 2 + X 3 ) + b(X 2 - X 3 ). 
Clearly X 2 + X 3 , and X 2 — X 3 are linearly independent and span Im (L). Thus 

Im (L) = span (X 2 + X 3 , X 2 - X 3 ) . 


Homework 


Problem 5.2.1 In problem 5.1.1 we saw that L : 

np3 


X 


x — y — z 

V 

= 

x + y + z 

z 


z 


is linear. Determine ker (L) and Im (L) 


Problem 5.2.2 Consider the function L : 
by 


x + y 

*-y 


given 
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1 . Prove that L is linear. 

2. Determine ker (L). 

3. Determine Im (L). 

Problem 5.2.3 Let 

R 3 -> R 4 

L : 

m Lf^) 

satisfy 


r 


1 




2 




1 

1 


0 


1 


-1 


0 



0 


-1 

; L 

1 


0 

; L 

0 


1 

0 




0 




1 





0 




0 




0 


Determine ker (L) and Im (L). 


Problem 5.2.6 It is easy to see that L : R 3 — > R 2 , 

x — y—z 
y — 2z 

is linear. Determine ker (L) and Im (L). 

Problem 5.2.7 Let 

1VI2 X 2 ( R ) — t R 

L : 

A i-> tr(A) 
Determine ker (L) and Im (L). 

Problem 5.2.8 1. Demonstrate that 


x 



z 


Problem 5.2.4 It is easy to see that L : R 2 — > R 3 , 


L 


x 


x + 2y 
x + 2y 
0 


is linear. Determine ker (L) and Im (L). 


Problem 5.2.5 It is easy to see that L : R 2 -> R 3 , 


L 


x 


y 


*-y 

X + y 
0 


M2 x 2 (R) — > M2 x 2 [R) 

L : 

A A T + A 

is a linear transformation. 

2. Find a basis for ker (L) and find dim ker (L) 

3. Find a basis for Im (L) and find dimlm (L). 


Problem 5.2.9 Let V be an n-dimensional vector space, 
where the characteristic of the underlying field is differ- 
ent from 2. A linear transformation T : V — > V is idempo- 
tent if T 2 = T. Prove that if T is idempotent, then 

O I — T is idempotent (I is the identity function). 

© I +T is invertible. 


is linear. Determine ker (T) and Im (T). 


© ker (T) = Im (I - T) 
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5.3 Matrix Representation 

Let V, W be two vector spaces over the same field F. Assume that dim V = m and {A? t}t e n ;m ] is an ordered 
basis for V, and that dim W = n and sd = {v^i} ie [i ;n ] an ordered basis for W. Then 


Lfv/i 


+ 0 . 21^2 + 


+ ClnlV^n 


dll 

d-21 


Lfv 2 ] 


ai 2 wi 


0-22^2 


Un2W„ 


d-nl 

ai2 

d-22 


d-n2 


L(^m) = aimWi + a 2 m w 2 


CX 7xr n W n 


n 

&2n 


248 Definition The n x m matrix 



di i 

a 12 • 

* 0-1 n 


d-21 

a 12 • 

* GfZri 

M l = 

d-nl 

Cln2 • 

* CLft m 


formed by the column vectors above is called the matrix representation of the linear map L with respect 
to the bases t } ie[1 ;m] , ;n] • 

249 Example Consider L : R 3 — > R 3 , 


■ ■ 


■ 

X 


x — y — z 

y 

= 

x + y + z 

z 


z 


Clearly L is a linear transformation. 

1. Find the matrix corresponding to L under the standard ordered basis. 
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1 


1 


1 

2. Find the matrix corresponding to L under the ordered basis 

0 

> 

1 

) 

0 


0 


0 


1 


and the image of L. 


for both the domain 


Solution: ► 



1 


1 


0 


-1 


0 


-1 

1 . The matrix will be a 3x3 matrix. We have L 

0 

= 

1 

, L 

1 

= 

1 

, and L 

0 

= 

1 


0 


0 


0 


0 


1 


1 


whence the desired matrix is 


1 -1 -1 
1 1 1 

0 0 1 

2. Call this basis ■<// . We have 



1 


1 


1 


1 


1 


0 

L 

0 

= 

1 

= 0 

0 

+ 1 

1 

+ 0 

0 

= 

1 


0 


0 


0 


0 


1 


0 



1 


0 


1 


1 


1 


-2 

L 

1 

= 

2 

= -2 

0 

+ 2 

1 

+ 0 

0 

= 

2 


0 


0 


0 


0 


1 


0 


and 


1 


0 


1 


1 


1 


-3 

0 

= 

2 

= -3 

0 

+ 2 

1 

+ 1 

0 

= 

2 

1 


1 


0 


0 


1 


1 


whence the desired matrix is 


0 


-2 


-3 


1 2 2 


0 0 1 


◄ 


250 Example Let M n [x] denote the set of polynomials with real coefficients with degree at most n. 
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1. Prove that 


K3M — » Ki[x] 

L : 

p(x) i-> p"(x) 


is a linear transformation. Here p"(x) denotes the second derivative of p(x) with respect to x. 


2. Find the matrix of L using the ordered bases (1 , x, x 2 , x 3 } for R3M and {1 , x} for K2M. 

3. Find the matrix of L using the ordered bases (1 , x, x 2 , x 3 } for K3 [x] and {1 , x + 2} for I3 [x]. 

4. Find a basis for ker (L) and find dimker (L). 

5. Find a basis for Im (L) and find dimlm (L). 

Solution: ► 


1. Let (p(x), q(x)) 6 (R3M) 2 and agl. Then 


L(p(x) + aq(x)) = (p(x) + aqfx))" = p"(x) + aq"(x) = L(p(x)) + aL(q(x)), 


whence L is linear. 
2. We have 


L(1) 


L(x) 


L(x 2 ) 


L(x 3 ) 






0 = 0(1 ) + 0(x) 


0 = 0(1 ) + 0(x) 


2 = 2(1 ) + 0(x) 


6x = 0(1 ) + 6(x) 


0 

0 

0 

0 

2 

0 

0 

6 


whence the matrix representation of L under the standard basis is 


0 0 2 0 


0 0 0 6 
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3. We have 

W) 

L(x) 

L(x 2 ) 

L(x 3 ) 






0 = 0(1 ) + 0(x + 2) 


0 = 0(1 ) + 0(x + 2) 


2 = 2(1 ) + 0(x + 2) 


6x = -12(1) +6(x + 2) 


0 

0 

0 

0 

2 

0 


-12 

6 


whence the matrix representation of L under the standard basis is 


0 0 2 -12 

0 0 0 6 


4. Assume thatp[x) = a + bx + cx 2 + dx 3 E ker (L). Then 

0 = L(p(x)) = 2c + 6dx, Vx E M. 

This means that c = d = 0. Thus a, b are free and 

ker (L) = {a + bx : (a, b) E R 2 }. 

Hence dim ker (L) = 2. 

5. By the Dimension Theorem, dimlm (L) =4 — 2 = 2. Put q(x) = a + |3x + yx 2 + 6x 3 . Then 

L(q(x)) = 2y + 6S(x) = (2y)(1) + (66)(x). 

Clearly {1 , x} are linearly independent and span Im (L) . Hence 

Im (L) = span (1 ,x) = Ri[x], 


251 Example 

1 . A linear transformation T : R 3 — > R 3 is such that 


2 


3 


— ) 


1 

; T( j ) = 

0 

1 


-1 


It is known that 

Im (T) = span (J( i),T( j )) 
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and that 



( 

1 

\ 

ker (T) = span 


2 



\ 

-1 

/ 


Argue that there must be A and p such that 

T(k) =AT(“l > ) + pT(t). 


2. Find A and p, and hence, the matrix representing T under the standard ordered basis. 


Solution: ► 

1 . Since T(lc ) E Im (T) and Im (T) is generated by T ("i ) and T ( 1c ) there must be (A, p) 6 R 2 
with 



2 


3 


2A + 3p 

V V V 






T(k)=AT(i) + pT(j)=A 

1 

+ h 

0 

— 

A 


1 


-1 


A - p 


2. The matrix ofTis 


T(t) 


T(t) T(t) 


2 3 2A + 3p 

1 0 A 


1-1 A-p 


Since 


1 

2 


6 ker (T) we must have 


2 3 2A + 3p 


1 


0 

1 0 A 


2 

= 

0 

1-1 A-p 


-1 


0 


Solving the resulting system of linear equations we obtain A = 1 , p = 2. The required matrix 
is thus 

2 3 8 

10 1 
1 -1 -1 

◄ 

If the linear mapping L : V — » W, dimV = n.dimW = m has matrix representation 
A 6 M mxn (F), then dimlm (L) = rank (A). 
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Homework 


Problem 5.3.1 Let T : 


be a linear transformations such that 



1 




1 




1 




-1 






0 




1 




0 




1 


1 




0 




1 




-2 



T 

1 


0 

T 

1 


1 

T 

1 


0 

T 

0 


1 




1 




-1 




-1 




1 


1 




0 




0 




0 




Find the matrix of T with respect to the canonical bases. Find the dimensions and describe ker (T) and Im (T). 


Problem 5.3.2 1. A linear transformation T : R 3 — > R 3 has as image the plane with equation x + y + z = 0 and 

as kernel the line x = y = z. If 



1 


a 


2 


3 


1 


-1 

T 

1 

= 

0 

, T 

1 

= 

b 

, T 

2 

= 

2 


2 


1 


1 


-5 


1 


c 


Find a, b, c. 

2. Find the matrix representation of T under the standard basis. 


Problem 5.3.3 1. Prove that T : R 2 — > R 3 


T 


x 


y 


is a linear transformation. 

2. Find a basis for ker (T) and find dim ker (T) 

3. Find a basis for Im (T) and find dimlm (T). 


x + y 

x-y 

2x + 3y 


4. Find the matrix of T under the ordered bases ,<// = 


of R 2 and S§ = < 



1 


1 


0 



1 

> 

0 

> 

1 



1 


-i 


0 



> of I 


Problem 5.3.4 Let 

where 


x + 2y 
3x — z 



R 3 -> R 2 

M Lfci) 


Clearly L is linear. Find a matrix representation for L if 
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1. The bases for both R 3 and R 2 are both the standard ordered bases. 



1 


1 


1 



2. The ordered basis for R 3 is 

0 

) 

1 

> 

1 

and R 2 has the standard ordered basis . 


0 


0 


1 




1 


1 


1 


f 


3. The ordered basis for R 3 is 

0 

) 

1 

) 

1 

and the ordered basis for R 2 is = < 


1 









0 


0 


0 


1 





Problem 5.3.5 A linear transformation T : R 2 — > R 2 satisfies ker (T) 


Im (T), and T 


1 

1 


representing T under the standard ordered basis. 


Problem 5.3.6 Find the matrix representation for the linear map 


M 2 x 2 (R) — i R 

A h-> tr(A) 


under the standard basis 


for M 2X 2 (R). 



1 

0 


0 1 


0 

0 


1 

0 

0 

1 


0 

0 


0 0 


1 

0 


0 1 

1 



2 

. Find the matrix 
3 


Problem 5.3.7 Let A £ M nxp (R), B £ M px q (R), and C £ M q xr (R), be such that rank ( B ) =rank(AB ). Shew that 

rank (BC) = rank (ABC) . 
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Determinants 


6.1 Permutations 

252 Definition Let S be a finite set with n > 1 elements. A permutation is a bijective function t : S — » S. 
It is easy to see that there are n! permutations from S onto itself. 


Since we are mostly concerned with the action that r exerts on S rather than with the particular names 
of the elements of S, we will take S to be the set S = (1 , 2, 3, . . . , n}. We indicate a permutation t by 
means of the following convenient diagram 


1 2 ••• n 

t(1) t(2) ••• T(n) 


253 Definition The notation S n will denote the set of all permutations on {1 ,2, 3, . . . ,n}. Under this 
notation, the composition of two permutations (t, <t) E is 



1 2 

n 


1 

2 

n 

TO (J = 

T(l) t(2) 

••• t(u) 

0 

ad) 

a(2) 

••• a(n) 


1 

2 



n 



(to a)(1) 

(to cr ) (2) 


(t 0 

a)(n) 



The k-fold composition of t is 


t o • • • o t = T 

k compositions 


We usually do away with the o and write to a simply as to. This “product of permutations” 
is thus simply function composition. 

Given a permutation r : S -4 S, since t is bijective, 

t ^ : S — > S 
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exists and is also a permutation. In fact if 

1 2 ••• n 

T = 

t( 1 ) t(2) ••• x(n) 

then 



Figure 6.1: S 3 are rotations and reflexions. 


254 Example The set S3 has 3! = 6 elements, which are given below. 
1. Id : {1 , 2, 3} — > {1 , 2, 3} with 


2 . Ti : { 1 , 2 , 3} — > { 1 , 2 , 3} with 


3. t 2 : {1 , 2, 3} — > {1 , 2, 3} with 


4. t 3 : {1 , 2, 3} -> {1 , 2, 3} with 


5. cti : {1 , 2, 3} — » {1 , 2, 3} with 


Id 


Tt = 


t 2 = 


T3 = 


CTl = 


1 2 3 

1 2 3 

1 2 3 

1 3 2 

1 2 3 

3 2 1 

1 2 3 

2 1 3 

1 2 3 

2 3 1 
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6. 02 : {1 , 2, 3} — > , 2, 3} with 


02 


1 2 3 

3 1 2 


255 Example The compositions ti o cn and cti o ti can be found as follows. 



1 2 3 


1 2 3 


1 2 3 

T) O ffi = 


o 


= 



1 3 2 


2 3 1 


3 2 1 

(We read from right to left 1 — > 2 — » 3 

(“1 goes to 2, 2 goes to 3, 

so 1 goe 


1 2 3 


1 2 3 


1 2 3 

01 O T] = 


o 


= 



2 3 1 


1 3 2 


2 1 3 


= t 2 . 


= t 3 . 


Observe in particular that ci o ti ^ ti o <ti. Finding all the other products we deduce the following 
“multiplication table” (where the “multiplication” operation is really composition of functions) . 


O 

Id 

Ti 

t 2 

T3 

Cl 

02 

Id 

Id 

Ti 

t 2 

T3 

Cl 

02 

Ti 

Ti 

Id 

Cl 

02 

t 2 

T3 

T2 

t 2 

02 

Id 

Cl 

T3 

Ti 

T3 

T3 

o-i 

02 

Id 

Ti 

t 2 

0-2 

C2 

t 2 

T3 

Ti 

Id 

Cl 

Cl 

Cl 

T3 

Ti 

t 2 

02 

Id 


The permutations in example 254 can be conveniently interpreted as follows. Consider an equilateral 
triangle with vertices labelled 1, 2 and 3, as in figure 6.1. Each r a is a reflexion (“flipping”) about 
the line joining the vertex a with the midpoint of the side opposite a. For example ti fixes 1 and 
flips 2 and 3. Observe that two successive flips return the vertices to their original position and so 
(Va € {1 , 2, 3}) (tJ = Id ). Similarly, is a rotation of the vertices by an angle of 120°. Three successive 
rotations restore the vertices to their original position and so = Id . 


256 Example To find t 1 1 take the representation of r^ and exchange the rows: 

1 3 2 

1 2 3 

This is more naturally written as 
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Observe that t-, 1 = Ti . 

257 Example To find cr 1 1 take the representation of cri and exchange the rows: 

2 3 1 

1 2 3 

This is more naturally written as 

1 2 3 

3 1 2 

Observe that o^ 1 = 02- 

6.2 Cycle Notation 

We now present a shorthand notation for permutations by introducing the idea of a cycle. Consider in 
S? the permutation 

1 23456789 

T = 

213697845 

We start with 1. Since 1 goes to 2 and 2 goes back to 1, we write (12). Now we continue with 3. Since 3 
goes to 3, we write (3). We continue with 4. As 4 goes 6, 6 goes to 7, 7 goes 8, and 8 goes back to 4, we 
write (4678). We consider now 5 which goes to 9 and 9 goes back to 5, so we write (59). We have written 
t as a product of disjoint cycles 

x= (12) (3) (4678) (59) . 

This prompts the following definition. 

258 Definition Let l > 1 and let ii , . . . , ii E {1,2,... nj be distinct. We write (ii V2 ... L ) for the element 

o' 6 S n such that cr(i r ) = i r +i , 1 < r < l, cr(ii) = ii and c(i) = i for i §? (ii , . . . , ii). We say that 

(ii 12 ... ii) is a cycle of length l. The order of a cycle is its length. Observe that if t has order l then 
t 1 = Id . 




Observe that (h ■ ■ • ii ii ) = (ii • • ■ ii) etc., and that (1 ) 

have 

(ii • • • h) = (ii • • ■ jm) 


( 2 ) 


(n) = Id . In fact, we 


if and only if (1) l = m and if (2) l > 1 : 3a such that Vk: i k = j k , u mod i- Two cycles (ii , . . . , ii) 
and (ji , . . . , j m ) cure disjoint if (ii , . . . , ii} n {ji , ... , j m ) = 0. Disjoint cycles commute and if 
r = ci 02 • • • 04 is the product of disjoint cycles of length li , I 2 > - - - > It respectively, then t has 
order 

lcm (li , l 2) . . . , l t ) . 


259 Example A cycle decomposition for a 6 S9, 


1 


a = 


1 


23456789 

87623459 


is 

(285) (3746). 

The order of a is lcm (3, 4) =12. 
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260 Example The cycle decomposition p = (123) (567) in S9 arises from the permutation 


P 


123456789 

231467589 


Its order is 1cm (3, 3) = 3. 

261 Example Find a shuffle of a deck of 13 cards that requires 42 repeats to return the cards to their 
original order. 

Solution: ► Here is one (of many possible ones). Observe that 7 + 6 = 13 and 7 x 6 = 42. We 
take the permutation 

(1 2 3 4 5 6 7)(8 9 10 11 12 13) 

which has order 42. This corresponds to the following shuffle: For 

i G {1 , 2, 3,4, 5, 6, 8, 9, 10, 1 1 , 12}, 

take the ith card to the (i + 1 )th place, take the 7 th card to the first position and the 13th card 
to the 8th position. Query: Of all possible shuffles of 13 cards, which one takes the longest to 
restitute the cards to their original position? ◄ 

262 Example Let a shuffle of a deck of 10 cards be made as follows: The top card is put at the bottom, 
the deck is cut in half, the bottom half is placed on top of the top half, and then the resulting bottom 
card is put on top. How many times must this shuffle be repeated to get the cards in the initial order? 
Explain. 


Solution: ► Putting the top card at the bottom corresponds to 

12345678 9 10 

23456789 10 1 

Cutting this new arrangement in half and putting the lower half on top corresponds to 

123 4 56789 10 

789 10 12345 6 

Putting the bottom card of this new arrangement on top corresponds to 


1234 5 6789 10 

6789 10 1234 5 


(1 6) (2 7) (3 8) (4 9) (5 10). 


The order of this permutation is 1cm (2, 2, 2, 2, 2) = 2, so in 2 shuffles the cards are restored to 
their original position. ◄ 

The above examples illustrate the general case, given in the following theorem. 

263 Theorem Every permutation in S n can be written as a product of disjoint cycles. 
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Proof: Let t 6 S n , ai G {1 , 2, . . . , n}. Put x k ( ai ) = a.k+ 1 , k > 0. Let ai , az , . . . , a s he the longest 
chain with no repeats. Then we have r( a s ) = ai . If the {ai , a 2 , . . . , a s } exhaust {1 , 2, . . . , n} 
then we have r = (ai a 2 . . . a s ). If not, there exist bi G {1,2, . . . , n} \ (ai , d 2 , . . . , a s ). Again, 
we find the longest chain of distinct bi , t> 2 , . . . , b t such that T(b k ) = b k+ i ,k = 1 , . . . , t — 1 
and x(b t ) = bi .If the {ai , a 2 , ...» a s , bi , b 2 , ... , b t } exhaust all the {1 , 2, . . . , n} we have t = 

(ai a 2 ... a s )(bi b 2 ... b t ). If not we continue the process and find 

t = (ai a 2 ... a s )(bi b 2 ... b t )(ci . . .) 

This process stops because we have onlg n elements. □ 

264 Definition A transposition is a cycle of length 2. 1 

265 Example The cycle (23468) can be written as a product of transpositions as follows 

(23468) = (28) (26) (24) (23). 

Notice that this decomposition as the product of transpositions is not unique. Another decomposition is 

(23468) = (23) (34) (46) (68). 

266 Lemma Every permutation is the product of transpositions. 

Proof: It is enough to observe that 

(ai a 2 ... a s ) = (ai a s )(ai a s -i ) • • • (ai a 2 ) 
and appeal to Theorem 263. □ 

Let cr G S n and let (i , )) G {1 , 2, . . . , n} 2 , i f ). Since a is a permutation, 3 (a, b) G {1,2,..., n} 2 , a =fb, 

such that cr(j) — a(i) = b — a. This means that 


n 

■ I <i<j <n 


q(t) - tr(j) 

i -) 


= 1. 


267 Definition Let a G S n . We define the sign sgn(cr) of a as 


If sgn(ff) = 1, then we say that a is an even permutation, and if sgn(c) = — 1 we say that a is an odd 
permutation. 

Notice that in fact 

sgn(o-) = (-1) I(<T \ 

where I(cr) = #{(i, j) 1 1 < i < j < n and cr(i) > a(j)}, i.e., I(ct) is the number of inversions that a 
effects to the identity permutation Id . 

268 Example The transposition (1 2) has one inversion. 

269 Lemma For any transposition (k l) we have sgn( (k l) ) = — 1 . 


sgn(ff) = ]^[ 

1 <i<j <n 


g(i) - o~(j ) 
i- j 


= (- 1 ) 


1 A cycle of length 2 should more appropriately be called a bicycle. 
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Proof: Let x be transposition that exchanges k and l, and assume that k < l: 


1 

2 .. 

. k — 1 

k 

k + 1 .. 

.. 1-1 

l 

l + l .. 

n 

1 

2 .. 

. k- 1 

l 

k + 1 .. 

. . 1-1 

k 

l + l .. 

n 


Let us count the number of inversions of x: 

• The pairs (i, )) withi £ {1 , 2, . . . , k — 1}U {1,1 + 1 , . . . ,n} and i < ) do not suffer an inversion; 

• The pair (k, j) with k < j suffers an inversion if and only if j £ {k + 1 , k + 2, . . . , l}, making 
l — k inversions; 

• (fi£{k + 1,k + 2, — 1} and i < j, (i, j ) suffers an inversion if and only if j = l, giving 

l — 1 — k inversions. 

This gives a total of I(t) = (l — k) + (l — 1 — k) = 2(1 — k — 1 ) + 1 inversions when k < l. Since 
this number is odd, we have sgn(x) = (—1 ) I!r ' = — 1 . In general we see that the transposition 
(k l) has 2|k — l| — 1 inversions. □ 

270 Theorem Let (tr, t) £ S^. Then 

sgn(xcr) = sgn(T)sgn(cr). 


Proof: We have 


sgn(CTx) 


XT ( cn:) (i) — (o-r)(j) 

1 ll <i<j<n 


i— J 


( rx olT(i)) — <t(t(j)) \ _ ( r-r t(v)— t(j) \ 

^ill<i<j<n x(i) — t( j ) J ' {ill<t<j<n i— j J 


The second factor on this last equality is clearly sgn(x), we must shew that the first factor is 
sgn(cr). Observe now that for 1 < a < b < n we have 

tx(a) — cr(b) o-(b) — cr(a) 
a — b b — a 


Since o andr are permutations, 3b =f a, x(i) = a, x(j) = b and so o-x(i) = tr(a), crx( j ) = b. Thus 


and so 


□ 


°~(x(i)) - cr(T(j)) _ tr(a) - a(b) 
x(i) — x(j) a — b 


n 

1 <i<j <n 


o~(x(i)) 

x(i) 


o~(x(j)) 

T(j) 


n 

1 <a<b<n 


o~(a) 

a 


o~(h) 

b 


sgn(o-). 


271 Corollary The identity permutation is even. If x £ S n , then sgn(x) = sgn(x 1 ). 

Proof: Since there are no inversions in Id , we have sgn(Id ) = ( — T ) 0 = 1 . Since xx 1 = Id , 

we must have 1 = sgn ld ) = sgnfxx -1 ) = sgn(x)sgn(x _1 ) = (—1 ) T (— 1 ) T 1 by Theorem 270. 
Since the values on the righthand of this last equality are ±1 , we must have sgn(x) = sgn ( x 1 ). 

□ 


272 Lemma We have sgn(1 2 ... 1)) = (—1 ) l 1 . 


Proof: Simply observe that the number of inversions of [I 2 ... 1) is l— !.□ 
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273 Lemma Let (t, (ii ... t v ) e Sjy Then 

T(ii . . . il)T _1 = (x(li ) . . . T(iv)), 
and if ct 6 S n is a cycle of length l then 

sgn(cr) = (-I) 1-1 


Proof: For 1 < k < l we have (x(ii ... i L )x MMik)) = x((ii ... h)(i k )) = x(ik+i). 

On a {r{U ... ii)T _1 ) (x(u)) = x((ii ... ii) (ii) ) = x(ii). For i £ {x(ii ) ... x(i x )} we have 
x~ 7 (i) ^ {ii ... ii) whence (ii ... ii)(x _1 (i)) = x~ 7 (i) etc. 

Furthermore, write a = (ii . . . ii). Let r 6 S n be such that x(k) = i k for 1 < k < l. Then 
a = x(l 2 . . . l)x _1 and so we must have sgn(cr) = sgn(x)sgn((l 2 . . . l))sgn(x _1 ), which 
equals sgn((1 2 ... 1)) by virtue of Theorem 270 and Corollary 271. The result now follows by 
appealing to Lemma 272 □ 

274 Corollary Let a = cti a 2 • • ■ oy he a product of disjoint cycles, each of length li , . . . , l r , respectively. 
Then 

sgn(cr) = (— 1 

Hence, the product of two even permutations is even, the product of two odd permutations is even, and 
the product of an even permutation and an odd permutation is odd. 

Proof: This follows at once from Theorem 270 and Lemma 273. □ 

275 Example The cycle (4678) is an odd cycle; the cycle (1 ) is an even cycle; the cycle (12345) is an even 
cycle. 

276 Corollary Every permutation can be decomposed as a product of transpositions. This decomposition 
is not necessarily unique, but its parity is unique. 


Proof: This follows from Theorem 263, Lemma 266, and Corollary 274. □ 


277 Example (The 15 puzzle) Consider a grid with 16 squares, as shewn in (6.1), where 15 squares are 
numbered 1 through 15 and the 16th slot is empty. 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 



( 6 . 1 ) 


In this grid we may successively exchange the empty slot with any of its neighbours, as for example 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 


15 


( 6 . 2 ) 
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We ask whether through a series of valid moves we may arrive at the following position. 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

15 

14 



(6.3) 


Solution: ► Let us shew that this is impossible. Each time we move a square to the empty 
position, we make transpositions on the set {1 , 2, . . . , 1 6}. Thus at each move, the permutation 
is multiplied by a transposition and hence it changes sign. Observe that the permutation cor- 
responding to the square in (6.3) is (14 15) (the positions 14th and 15th are transposed) and 
hence it is an odd permutation. But we claim that the empty slot can only return to its original 
position after an even permutation. To see this paint the grid as a checkerboard: 


B 

R 

B 

R 

R 

B 

R 

B 

B 

R 

B 

R 

R 

B 

R 

B 


(6.4) 


We see that after each move, the empty square changes from black to red, and thus after an 
odd number of moves the empty slot is on a red square. Thus the empty slot cannot return to its 
original position in an odd number of moves. This completes the proof. ◄ 


Homework 


Problem 6.2.1 Decompose the permutation 

1 23456789 

234158679 
as a product of disjoint cycles and find its order. 


6.3 Determinants 

There are many ways of developing the theory of determinants. We will choose a way that will allow 
us to deduce the properties of determinants with ease, but has the drawback of being computationally 
cumbersome. In the next section we will shew that our way of defining determinants is equivalent to a 
more computationally friendly one. 

It may be pertinent here to quickly review some properties of permutations. Recall that if a £ S n is a 
cycle of length l, then its signum sgn( cr) = ±1 depending on the parity of l — 1 . For example, in S7, 

<r= (1 34 76) 

has length 5, and the parity of 5 — 1 = 4 is even, and so we write sgn( ct) = +1 . On the other hand, 


t=(1 34765) 
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has length 6, and the parity of 6 — 1 = 5 is odd, and so we write sgn(x) = — 1 . 

Recall also that if (o', x) E S(p then 

sgn(xcr) = sgn(T)sgn(o-). 

Thus from the above two examples 

o-T = (1 3 4 7 6X1 3 4 7 6 5) 

has slgnum sgn(o-)sgn(x) = ( + 1 ) ( — 1 ) = —1. Observe in particular that for the identity permutation 
Id 6 Sn we have sgn(Id ) = +1 . 

278 Definition Let A E M nxn (F), A = [ap] be a square matrix. The determinant of A is defined and 
denoted by the sum 

det A = ^ Sgn( O') cl-] o-( i ) Cl2cjf2) ' * * G-ncr(n) • 

crGSn 


The determinantal sum has n! summands. 

279 Example If n = 1, then Si has only one member, Id , where Id (1) = 1. Since Id is an even 
permutation, sgn(Id ) = (+1 ) Thus if A = (an ), then 

det A = an 


280 Example If n = 2, then S 2 has 2! = 2 members, Id and o = (1 2). Observe that sgn( o) = — 1 . Thus if 


an 

ai2 

Q21 

022 


A = 

then 

det A = sgn(Id )ai Id ( i)a 2 id ( 2 ) + sgn(o')ai cr( i ) a 2 a( 2 ) = an a 2 2 - ai 2 a 2 i. 
281 Example From the above formula for 2x2 matrices it follows that 

1 2 
3 4 

= (1 )(4) — (3) (2) = —2, 


and 


det A = det 


det B = det 


= -10, 


-1 2 
3 4 


(—1 ) (4) — (3) (2) 


det(A + B) = det 


0 4 
6 8 


= (0) (8) - (6) (4) =-24. 


Observe in particular that det (A + B) f det A + det B. 
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282 Example If n = 3, then S 2 has 3! = 6 members: 

Id ,ti = (2 3), x 2 = (1 3), t 3 = (1 2), cn =(1 2 3 ),o- 2 = (1 3 2). 
. Observe that Id , cri , 02 are even, and xi , x 2 , x 3 are odd. Thus if 



Cll 1 

Cll 2 

Cll 3 

Cl21 

Cl22 

Cl23 

Cl 3 1 

Cl 3 2 

Cl 3 3 


then 

det A = sgn ld )a 1Id ( i )a 2 id (2)Q3id (3) + sgnfxi )ai Tl (i )Ci 2 Xl ( 2 ) cm, (3) 

+Sgn(T2)ai X2 (1)02X2(2)0-3X2(3) + Sgn(x 3 ) ai X3 (1 ) Cl 2 X3 ( 2 ) Cl3x 3 (3) 

+sgn(ai Jaio-, (i 1020 -! ( 2 )^ 50 -, ( 3 ) + sgn[o2)ciiCT 2 (i)Ci2<T 2 (2)Ci 3(T2 ( 3 ) 
= Clll CL22O.33 — Cll 1 Q23&32 — Ch3<l22Cl31 

— cm cm a 33 + cmcmcm + cm cm cm- 


283 Theorem (Row-Alternancy of Determinants) Let A e M nxn (F), A = [a tj ], If B e M nxn (F), B = [by] is 
the matrix obtained by interchanging the s-th row of A with its t-th row, then det B = — det A. 


Proof: Let x be the transposition 


r = 


s t 
xft) xfs) 


Then crx(a) = o' (a) for a 6 (1 , 2, . . . , n} \ {s, t}. Also, sgn(crx) = sgn(cr)sgn(x) = — sgn( cr). As o' 
ranges through all permutations of S n , so does crx, hence 


det B = ^<x£S n cr)bi cr(i )h 2 c( 2 ) * * * b S{T ( S ) • • • b-to-ft) * * * b no -( n j 

= X!cr£S n CC) Cll cr ( 1 ) Cl2<x(2) * * * Q s t • • • Cits • • • Ctna-fn) 

~ crx) 0-1 (TT( 1 ) C12(tt(2) * * * CCsctt(s) * * * Ch(j X (t) * * * CCncT(n) 

= ~ ^A6Sn S £ n M a iA(i )02A(2) ' • * d n A( n ) 

= det A. 


□ 
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284 Corollary If A( r:k ) ,1 < k < n denote the rows of A and a 6 S n , then 


det 


A( r:<T (i ) 

A( r;0 -(2) 

A( r;0 -fn) 


= (sgn(o-)) det A. 


An analogous result holds for columns. 

Proof: Apply the result of Theorem 283 multiple times. □ 

285 Theorem Let A <E M nxn (F), A = [a tj ]. Then 

det A t = det A. 


Proof: Let C = A T . By definition 

det A t = det C 

= O') C 1 cr(1 ) C 2c (2 ) * ' ’ C no -(n) 

= X-crGSri o) Q-o- [1 ) 1 Cl 0 -(2)2 * * * 077(77)77 . 

But the product a, T( i ) i a (T r 2 1 2 • • • o, T ( n!n also appears in det A with the same signum sgn(cr), 
since the permutation 

ct(1) a {2) ■■■ cr(n) 

1 2 ••• n 

is the inverse of the permutation 

1 2 ••• n 

o(l) o(2) ••• cr(n) 

□ 

286 Corollary (Column-Alternancy of Determinants) Let A e M nxn (F), A = [a^]. If C e M nxn (F), C = [c tj ] 
is the matrix obtained by interchanging the s-th column of A with its t-th column, then det C = — det A. 

Proof: This follows upon combining Theorem 283 and Theorem 285. □ 

287 Theorem (Row Homogeneity of Determinants) Let A e M nxn (F), A = [a tj ] and a e F. IfB 6 M nxn (F), B 
[bij] is the matrix obtained by multiplying the s-th row of A by a, then 

det B = a det A. 


Proof: Simply observe that 

Sgll( O') ai o- : ] Q-2 ct(2 ) * * * CCCL so -( s ) • • • 0.77 n) = £XSgn( O') (X-j (j(l) 0-2 (7 (2) * * " Ct so -( s ) • • • 0-77(7(77) . 


□ 
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288 Corollary (Column Homogeneity of Determinants) If C e M nxn (F), C = (Cp) is the matrix obtained 
by multiplying the s-th column of A by a, then 


det C = a det A. 


Proof: This follows upon using Theorem 285 and Theorem 287. □ 


It follows from Theorem 287 and Corollary 288 that if a row (or column) of a matrix consists 
ofOyS only, then the determinant of this matrix is Op. 


289 Example 



X 

1 

a 


1 

1 

a 

det 

X 2 

1 

b 

= x det 

X 

1 

b 


X 3 

1 

c 


X 2 

1 

c 


290 Corollary 

det(ccA) = a n det A. 

Proof: Since there are n columns, we are able to pull out one factor of a from each one. □ 

291 Example Recall that a matrix A is skew-symmetric if A = — A T . Let A 6 M2001 (R) be skew- 
symmetric. Prove that det A = 0. 

Solution: ► We have 

det A = det(— A t ) = (-1 ) 2001 detA T = detA, 
and so 2 det A = 0,from where det A = 0. ◄ 

292 Lemma (Row-Linearity and Column-Linearity of Determinants) Let A e M nxn (F),A = [ay]. Fora 
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fixed row s, suppose that a S j = b S j + c sj for each ) E [l;n]. Then 


det 


a (s-1)1 a ( s — 1 ) 2 

bsl + C s1 ^s2 + c s2 
a (s + 1)1 a (s + 1)2 


det 


a 1 1 
a 2 1 


1 ( s — 1 ) 1 

b R l 


a 1 2 
a 22 


‘( 5 - 1)2 

b s 2 


x (s + 1)1 a ( s + 1 ) 2 


a 1 1 a l 2 

a 2 1 a 22 


+ det 


l (S-l )1 
c sl 


1 (S-1 )2 
c s 2 


x (s + 1)1 a ( s + 1 ) 2 


“In 

a 2n 


a (s-1)n 
b s n + c s n 


l (s + l ) 


“In 

a 2n 


l (s-l )n 
b sn 

l (s + l )n 


“In 

“2n 


l (s-1 ) 
c s n 

1 ( s + 1 ) 


An analogous result holds for columns. 


Proof: Pat 


S = 


u i 

*21 


l (s + l )1 


a 1 2 
a 22 

a (s-l )2 
3 s 2 + c s 2 
a (s + 1 )2 


“in 

“2n 


(s-1 )n 
b s n + c s n 
a (s + l ) n 


a ^ ^ 
a 2 1 


a 1 2 
a 22 


a (s-l)1 a ( s — 1 ) 2 
b sl b s 2 

a (s + l)1 a ( s + 1 ) 2 


“In 

“2n 


x (s-1 ) 
b s n 
x (s + 1 ) 


“n 1 
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and 


Then 


“11 “12 ' ' ' “In 

a 2 1 a 22 ’ • • a 2n 


U = 


“(si ) 1 a ( s — 1 ) 2 
c s 1 c s 2 

a (s + 1 )1 a (s + 1 ) 2 


a (s-l )n 
c sn 

a (s + l ) n 


a n1 a n2 


a nn 


det S = YL ctG S n s S n ( °") tr( 1 ) ^2a (2) ***d(s — 1 ) cr ( s — 1 ) ( t>s cr ( s ) 

"^^sals) )^(s + 1 )a(s + 1 ) **’ cr ( n ) 

= XI ctG S n s S n ( °") a 1 cr ( 1 ) **2cr(2) s — 1 ) <r ( s — 1 ) ct(s )d(s + 1)cr(s + 1) * * * ^nafnj 

^ cxG S n °") cr( 1 ) 0-2 g( 2) * * * ^-(s~l)cr(s — 1) ^sir(s) ^(s + IJcrts + l) * * * Cl-ncr (n) 

= det T + det U. 

By applying the above argument to A T , we obtain the result for columns. 

□ 


293 Lemma If two rows or two columns of A 6 M nxn (F), A = [a^] are identical, then det A = Op. 


Proof: Suppose a S j = a t j for s f t and for all ) G [1; n]. In particular, this means that for any 

(Tg S„ we have a scr (t) = a to -(t} and a tcr ( S ) = a scr ( S ). Letr be the transposition 


X = 


S t 

T(t) x(s) 


Then crr(a) = cr(a) for a 6 {1 , 2, . . . , n} \ {s, t}. Also, sgn(o-x) = sgn(cr)sgn(x) = — sgn(cr). As 
a runs through all even permutations, err runs through all odd permutations, and viceversa. 
Therefore 


detA 


y~ cr£Su ®§ri( a) 0-1 <x(1) 0-2(7 [2] ' ' * Cls<T(s) * * * O-tcrf t) * * * ^nerf n) 

H °-es„ (sgn(a)ai CT (i)a2 CT (2) •• • O-scr(s) * * * * * * ^nff(n] 

sgn(cr)=1 

+ Sgn(CTT)ai <tt( 1 ) a 2crT{2) • • • Qsctt(s) ■ ' ' a tCTT(t) " ' ‘ dncTTfn)) 

L . 6 S„ Sgn( & ) ( cr ( 1 ) 0-2 cr ( 2 ) * * * cr ( s ) * * * cr ( t ) * * * cr ( n ) 

sgn(cr)=1 

cr(1 ) 0-2o-(2) * * * &scr(t) * * * &tcr(s) * * * ^ng(n)) 

Y ^s n Sgn(cj) (a-|o.(-i)Cl 2 o-( 2 ) • * * &scr(s ) ^tcr(t) * Clncr(n) 

sgn( cr ) = 1 

0-1 cr ( 1 ) 0-2 (j(2) * ' * Q-t cr ( t ) ' ' " a S cr ( s ) ' * * Q-ncr ( n ) ) 

Op. 


Arguing on A T will yield the analogous result for the columns. □ 
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294 Corollary If two rows or two columns of A € M nxn (F), A = [ay] are proportional, then det A = Op. 

Proof: Suppose a S j = aa t j for s f t and for all j G D ; n]. If B is the matrix obtained by pulling 

out the factor a from the s-th row then det A = a det B. But now the s-th and the t-th rows in B 
are identical, and so det B = Op. Arguing on A T will yield the analogous result for the columns. 

□ 

295 Example 



1 

a 

b 


1 

1 

b 

det 

1 

a 

c 

= a det 

1 

1 

c 


1 

a 

d 


1 

1 

d 


since on the last determinant the first two columns are identical. 


296 Theorem (Multilinearity of Determinants) Let A e M nxn (F), A = [ap] and a e F. If X e M nxn (F), X = 
(xp) is the matrix obtained by the row transvection R s + aR t — > R s then detX = det A. Similarly, 
if Y G M nxn (F), Y = (yp) is the matrix obtained by the column transvection C s + aC t — > C s then 

det Y = det A. 


Proof: For the row transvection it suffices to take b S j = a S j , c s j = au t j forj G [1 ; n] in Lemma 

292. With the same notation as in the lemma, T = A, and so, 

det X = det T + det U = det A + det U. 

But U has its s-th and t-th rows proportional (s =f tj, and so by Corollary 294 detU = Op. 
Hence detX = det A. To obtain the result for column transvections it suffices now to also apply 
Theorem 285. □ 

297 Example Demonstrate, without actually calculating the determinant that 


2 9 9 


det 


4 


6 


8 


7 4 1 


is divisible by 13. 

Solution: ► Observe that 299, 468 and 741 are all divisible by 13. Thus 


2 9 9 


2 9 299 


1 

K> 

vO 

K> 

U> 

i 


C 3 + 10C 2 + 100Ci->C 3 , 




4 6 8 

det 

4 6 468 

= 13 det 

4 6 36 

1 

i 


7 4 741 


7 4 57 


which shews that the determinant is divisible by 13. ◄ 

298 Theorem The determinant of a triangular matrix (upper or lower) is the product of its diagonal 
elements. 


Determinants 
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Proof: Let A 6 M nxn (F), A = [cty] be a triangular matrix. Observe that if a f Id then 

a io-(i)Q<T(i)cr 2 (i) = 0 F occurs in the product 


Thus 


□ 


^1 cr(l ) ^-2cr(2) * * * O'ncr(n) • 


dot A ^(x£S n ^ ) ^-1 ct( 1 ) 0-2(7 (2) * * * Q'no'(n) 


= Sgnfld )and (1 ) a 2 Id (2) • • • O n id (n) = dl 1 <*22 • • • Onn- 


299 Example The determinant of the n x n identity matrix I n over a field F is 


detl n = 1 F . 


300 Example Find 


1 2 3 


det 


4 5 6- 


7 8 9 


Solution: ► We have 



1 

2 

3 

C 2 -2C!->C 2 


1 

0 

0 

det 

4 

5 

6 

C3— 3Ci — > C3 

' — > 

det 

4 

-3 

-6 


7 

8 

9 



7 

-6 

-12 


1 0 0 


(-3) (-6) det 


4 


1 


7 2 2 


0, 


since in this last matrix the second and third columns are identical and so Lemma 293 applies. 

◄ 


301 Theorem Let (A,B) e (M nxn (F)) 2 . Then 

det(AB) = (det A) (det B). 


Proof: Put D = AB,D = ( d V) ) , dg = ££ =1 a ik b kj . If A (c:k) , D( C:k ),1 < k < n denote the 

columns of A and D . respectively, observe that 

n 

D(c:k) = bikA( C ; V ), 1 < k < n. 

1=1 
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Applying Corollary 288 and Lemma 292 multiple times, we obtain 

det D = det ( D ( c: i j , D( c: 2 ) , . . . , D( c:n ) ) 

• det ( A( C:j 1 ) > A( c :)2 ) > • ■ • 1 A( c:jn ) ) ■ 

By Lemma 293, if any two of the A( C:jl ) are identical, the determinant on the right vanishes. So 
each one of the ji is different in the non-vanishing terms and so the map 

{1,2,...,n} -> {1 , 2, . . . , n} 
cr : 

l i— > 

is a permutation. Here j i = cr ( l ) . Therefore, for the non-vanishing 

det(A( c; j 1 ) , A( C: j 2 ) , . . . , A( C: j n ) ) 
we have in view of Corollary 284, 

det ( A( c: j , ) , A( c: j 2 A( c; j n ) ) = ( sgn ( cr) ) det f A( c:i ) > A( C: 2 ) > • ■ . > A( c:tV ) ) 

= (sgn(cr)) det A. 

We deduce that 

det(AB) = detD 

= ^j n =i ii ^2j 2 ' ' ' bnj n detfAfc^, } > A( C: j 2 ) , . . . , A( C; j n ) ) 

= (det A) Laes n ( s g n (°')) b iTmt> 2 o-( 2 ) •••b nff(n ) 

= (det A) (det B), 

as we wanted to shew. □ 

By applying the preceding theorem multiple times we obtain 

302 Corollary If A 6 M nxn (F) and if k is a positive integer then 

det A k = (det A) k . 

303 Corollary If A E GL n (F) and if k is a positive integer then det A =f Of and 

detA k = (detA) k . 

Proof: We have AA 1 = I n and so by Theorem 301 (det A) (det A 1 ) = 1 F , from where the 

result follows. □ 


Homework 


Problem 6.3.1 Let 

be ca ab 
a b c 
a 2 b 2 c 2 


a = det 


Laplace Expansion 
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Without expanding either determinant, prove that 


n = det 


1 1 1 


Problem 6.3.2 Demonstrate that 


£1 = det 


a — b — c 
2b 
2c 


2a 

b — c — a 
2c 


2a 

2b 

c — a — b 


= (a + b + c) 


Problem 6.3.3 After the indicated column operations on a 3 x 3 matrix A with det A = —540, matrices Ai , Ai , . . . , As 
are successively obtained: 


_ C -| + 3 C 2 — ^ C -j C, 2 t - * 3 ^ 5 l _2 — i_-| — > l_ 2 

/V — T 


_ C2^C3 _ 3C-2 — Ci — ^C2 * Ci — 3C.7 — >Cr „ 2Ci — ) C 1 _ 

A^ -> A 2 -f A 3 -f A 4 -> A 5 


Determine the numerical values of det Ai , det A2 , det A3 , det A 4 and det A5 . 
Problem 6.3.4 Prove, without actually expanding the determinant, that 

1 2 3 7 0 


det 


6 1 5 14 1 

8 6 1 21 3 

7 3 8 7 1 

2 4 6 0 4 


is divisible by 1722. 


Problem 6.3.5 Let A, B, C be 3 x 3 matrices with det A = 3,detB 3 = — 8,detC = 2. Compute (i) det ABC, (ii) 
det 5 AC, (iii) det A 3 B 3 C 1 . Express your answers as fractions. 

Problem 6.3.6 Shew that V A G M n xn(R), 

3(X,Y) G (Mnxn(R)) 2 , (detX)(detY) ^0 

such that 

A = X + Y. 

That is, any square matrix over R can be written as a sum of two matrices whose determinant is not zero. 

Problem 6.3.7 Prove or disprove! The set X = {A G M n xn(F) : det A = Of} is a vector subspace of M nX n(F). 

6.4 Laplace Expansion 

We now develop a more computationally convenient approach to determinants. 


Ctj = (sgn(ff) )ai CT (i ) cl 2(T [ 2 ) • • • a-n 


o - £ S tv 

cr ( t) =i 


Put 
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Then 


det A ^ ,j^ 5 n ( Sgll( O') ) 0-1 cx( 1 ) 0-2 (t( 2) * * ' O no -( Tl ] 

= zr^tiL cjG S n (sgn(CT))a 1o .(i)a2o-(2) 
o-(i)=j 

* * * ) cr(i— l ) 0(i+l ) cr(i+l ) * * * ^no-(r 

= Li=l a ijCtj, 

is the expansion of det A along the j-th column. Similarly, 

detA = Lo-eS n ( S g n (°')) a 1o-(1) a 2<T(2) ••• Ono-(n) 

= ZX=i a ij Y. »esn (sgn(o-))a ltJ (-i)a2 CT (2) 

cr(i)-j 

' * * 0(t_i ] cr(i— 1 ) O , i ■ 1 ] cr(i+l ; ' ' ' 

= Xp = l OijCtj, 

is the expansion of det A along the i-th row. 

304 Definition Let A E M nxn (F), A = [dp]. The ij-th minor Ap E M n _i I 
obtained by deleting the i-th row and the j-th column from A. 


is the (n — 1 ) > 


305 Example If 


A = 


1 2 3 
4 5 6 
7 8 9 


then, for example. 



5 6 


4 6 


2 3 


1 3 


An = 


> Ai2 = 


, A21 = 


, A22 = 


, A33 = 


8 9 


7 9 


8 9 


7 9 



306 Theorem Let A <E M nxn (F). Then 


detA = 22 ap(— l) l+j detAp = 22 Op(— 1) 1+j detAp. 


i=l 


j = 1 


Proof: It is enough to shew, in view of 6.5 that 

(-1) i+i detAp = Cp. 

Now, 

Cnn y o'GSn Sgn(o-)a lo . (11 a 2 CT ( 2 ) • • • 0(n-1 )<t(ti -1 ) 

o'(n)=n 

= L. e s n _, S S n ( T ) a lx(1)02T(2) • • • 0 (tl-1 )x(n-1 ) 

= det A ii u , 


(6.5) 


(n — 1 ) matrix 


2 

5 
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since the second sum shewn is the determinant of the submatrix obtained by deleting the last 
row and last column from A. 

To find Cij for general ij we perform some row and column interchanges to A in order to bring ay 
to the nn-th position. We thus bring the i-th row to the n-th row by a series of transpositions, first 
swapping the i-th and the (i+1 )-th row, then swapping the new (i+ 1 )-th row and the (i+2 )-th 
row, and so forth until the original i-th row makes it to the n-th row. We have made thereby 
n — i interchanges. To this new matrix we perform analogous interchanges to the ) -th column, 
thereby making n — j interchanges. We have made a total of 2n — i — j interchanges. Observe 
that (— = (— 1) 1+ h Call the analogous quantities in the resulting matrix A' , C^ n ,A^ n . 
Then 

Cij = c; n = det A; n = det At,, 

by virtue of Corollary 284. 

□ 


It is irrelevant which row or column we choose to expand a determinant of a square matrix. 
We always obtain the same result. The sign pattern is given by 


307 Example Find 


by expanding along the first row. 

Solution: ► We have 


1 2 3 


det 


4 5 


6 


7 8 9 


det A 


5 6 


4 6 


4 5 


+ 2(— 1) 1+2 det 


+ 3(— 1) 1+3 det 


8 9 


7 9 


7 8 


1 (45 - 48) - 2(36 - 42) + 3(32 - 35) = 0. 


◄ 

308 Example Evaluate the Vandermonde determinant 


det 


1 1 1 

a b c 


b 2 c 2 
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Solution: ► 



1 

1 

1 


1 0 0 

det 

a 

b 

c 

= det 

a b — a c — a 


a 2 

b 2 

c 2 


a 2 b 2 — a 2 c 2 — a 2 


det 


b — a c — a 
b 2 — c 2 c 2 — a 2 

1 1 
b + a c + a 

(b — a)(c — a)(c — b). 


= (b — a)(c — a) det 


309 Example Evaluate the determinant 


det 



1 

2 


2 

1 


3 

2 

det 

4 

3 


2000 

1999 

Tt+l 

— > R n 

VI 

T — 

u . 

-1 

1 

1 

-1 

-1 

1 

-1 

-1 

-1 

-1 

-1 

-1 

-1 

-1 

-1 


3 4 

2 3 

1 2 

2 1 


2000 

1999 

1998 

1997 


Solution: ► Applying R n — R n+ i — > R n for 1 < n < 1999, the determinant becomes 


2000 1999 1998 1997 


1 1 
1 1 
1 1 
1 1 

1 1 
2 1 
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Applying now C n + C 2000 — » C n for 1 < n < 1 999, we obtain 


det 


0 

0 

0 

0 


2 2 2 
0 2 2 
0 0 2 
0 0 0 


2 1 
2 1 
2 1 
2 1 


0 0 0 0 
2001 2000 1999 1998 


0 

3 


This last determinant we expand along the first column. We have 


2001 det 


2 

2 

2 •• 

• 2 

1 

0 

2 

2 •• 

• 2 

1 

0 

0 

2 •• 

• 2 

1 

0 

0 

0 •• 

• 2 

1 

0 

0 

0 •• 

• 0 

1 


= 2001(2 


1998-1 


◄ 

310 Definition Let A g M nxn (F). The classical adjoint or adjugate of A is the n x n matrix adj (A) whose 
entries are given by 

[adj (A)]tj = (— 1) i+j detAji, 

where Ajt is the ji-th minor of A. 


311 Theorem Let A e M nxn (F). Then 

(adj (A)) A = A(adj (A)) = (det A)I n . 


Proof: We have 


[A(adj(A))] t , = £Li a ik [adj (A)] k , 

= L£=i a tk (-1) t+k detA jk . 

Now, this last sum is det A i/' i = j by virtue of Theorem 306. Ifx -f) it is 0, since then the j -th 
row is identical to the i -th row and this determinant is Of by virtue of Lemma 293. Thus on the 
diagonal entries we get det A and the off-diagonal entries are Op. This proves the theorem. □ 

The next corollary follows immediately. 

312 Corollary Let A E M nxn (F). Then A is invertible if and only det A f Op and 

A _-, = adj (A) 

det A 
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Homework 


Problem 6.4.1 Find 


det 


1 2 3 
4 5 6 
7 8 9 


by expanding along the second column. 

a b c 

Problem 6.4.2 Prove that det c a b 

b c a 

Problem 6.4.3 Compute the determinant 


= a 3 + b 3 + c 3 — 3abc. This type of matrix is called a circulant matrix. 


det 


10-1 1 

2 0 0 1 

666 -3 -1 1000000 

10 0 1 


Problem 6.4.4 Prove that 


det 


x + a b 

a x + b 
a b 


= x (x + a + b + c) 


Problem 6.4.5 If 


det 


0, 


and xabc yO, prove that 


Problem 6.4.6 Consider the matrix 


1111 

x a 0 0 
x 0 b 0 
x 0 0 c 


1111 

— F TT d • 

x a b c 


a — b — c — d 

bad — c 
c — d a b 

d c — b a 


Laplace Expansion 
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O Compute A T A. 

© Use the above to prove that 


Problem 6.4.7 Prove that 


det 


det A = (a 2 + b 2 + c 2 + d 2 ) 2 . 


0 a b 0 
a 0 b 0 
0 a 0 b 
1111 


Problem 6.4.8 Demonstrate that 


det 


det 


2abfa — b) 


a 0 b 0 
0 a 0 b 
c 0 d 0 
0 c 0 d 


= (ad — be) 


Problem 6.4.9 Use induction to shew that 

1 1 1 

1 0 0 

0 1 0 

0 0 1 

0 0 0 


1 1 

0 0 
0 0 
0 0 

1 0 


= (- 1 ) 


n+1 


Problem 6.4.10 Let 


1 n n n 

n 2 n n 

n n 3 n 

n n n 4 


n 


n 


n n n n n n 

that is, A g M n xn(R), A = [ap] is a matrix such that a^k = k and ap = n when i =/ j. Find det A. 

Problem 6.4.11 Let n g N,n > 1 be an odd integer. Recall that the binomial coefficients ('’) satisfy 
and that for 1 < k < n. 


/n\ /n\ 
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Prove that 


det 


1 1 


\n — 2/ \n— 1 > 


\n-3j \n — 2/ 


= (i + HD T 


Problem 6.4.12 Let A G GL n (F), n > 1. Prove that det(adj (A)) = (det A) n 1 . 

Problem 6.4.13 Let (A, B,S) G (GL n (F)) 3 . Prove that 
O adj (adj (A)) = (det A) n ~ 2 A. 

© adj (AB) = adj (A) adj (B). 

© adj (SAS- 1 ) = S(adj (A))S _1 . 


Problem 6.4.14 If A G GL 2 (F), , and let k be a positive integer. Prove that det (adj • ■ ■ adj (A) ) = det A. 


Problem 6.4.15 Find the determinant 


det 


(b + c) 2 

ab 

ac 

ab 

(a + c) 2 

be 

ac 

be 

(a + b) 


by hand, making explicit all your calculations. 

Problem 6.4.16 The matrix 

abed 
d a b c 
c d a b 
b c d a 

is known as a circulant matrix. Prove that its determinant is (a + b + c + d)(a — b + c — d)((a — c) 2 + (b — d) 2 ) 


6.5 Determinants and Linear Systems 

313 Theorem Let A G M nxn (F). The following are all equivalent 
O det A 7/ Of- 
0 A is invertible. 

© There exists a unique solution X G M nx i (F) to the equation AX = Y. 
O If AX = O nx i then X = O nx i . 


Determinants and Linear Systems 
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Proof: We prove the implications in sequence: 

O => ©: follows from Corollary 312 

© =$. ©; if A is invertible and AX = Y then X = A 1 Y is the unique solution of this equation. 

© => Q: follows by putting Y = 0 nx i 

© => ©.' Let R be the row echelon form of A. Since RX = 0 nx i has only X = 0 nx i as a solution, 
every entry on the diagonal of R must be non-zero, R must be triangular, and hence det R / CL. 
Since A = PR where P is an invertible n x n matrix, we deduce that det A = det P det R =/ CL. 

□ 

The contrapositive form of the implications O and © will be used later. Here it is for future reference. 
314 Corollary Let A € M nxn (F). If there is X f 0 nx i such that AX = 0 nx i then det A = Op. 


Homework 


-i i i 


Problem 6.5.1 For which a is the matrix -j 


singular (non-invertible)? 


1 1 a 


Chapter 

Eigenvalues and Eigenvectors 

7.1 Similar Matrices 

315 Definition We say that A E M nxn (F) is similar to B E M nxn (F) if there exist a matrix P E GL n (F) 
such that 

B = PAP 1 . 


7 


3 1 6 Theorem Similarity is an equivalence relation. 


Proof: Let A E M nxn (F). Then A = I n AI n 1 , so similarity is reflexive. If B = PAP -1 (P E 

GL n (F) ) then A = P 1 B P so similarity is symmetric. Finally, if B = PAP -1 and C = QBQ 1 
fP E GL n (F) , Q E GL n (F)J then C = QPAP 1 Q 1 = QPA(QP) 1 and so similarity is transitive. 

□ 

Since similarity is an equivalence relation, it partitions the set of n x n matrices into equivalence classes 
by Theorem 29. 


317 Definition A matrix is said to be diagonalisable if it is similar to a diagonal matrix. 


Suppose that 


Ai 0 0 •• • 0 

0 A 2 0 • • • 0 


Then if K is a positive integer 


0 0 0 • • • A n 

A* 0 0 • • • 0 


a k 


0 A* 0 


0 


0 0 0 • • • a£ 
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In particular, if B is similar to A then 


B k = (PAP 1 ) (PAP 1 ) • • • (PAP 1 ) = PA K P 1 = P 


K factors 




0 0 


0 a 2 k 0 


0 0 0 


0 

0 


p \ 


so we have a simpler way of computing B K . Our task will now be to establish when a particular square 
matrix is diagonalisable. 


7.2 Eigenvalues and Eigenvectors 

Let A E M nxn (F) be a square diagonalisable matrix. Then there exist P E GL n (F) and a diagonal matrix 
D E M nxn (F) such that P 1 AP = D, whence AP = DP. Put 


D = 


, P = [Pi ; P 2 ; • • • ; Pn], 


Ai 0 0 •• • 0 

0 A 2 0 • • • 0 

0 0 0 • • • A n 

where the P k are the columns of P. Then 

AP = DP =£• [APi;AP 2 ;- • • ;AP n ] = [AiPi;A 2 P 2 ;- • • ;A n P n ], 
from where it follows that AP k = A k P k . This motivates the following definition. 


318 Definition Let V be a finite-dimensional vector space over a field F and let T : V -> V be a linear 
transformation. A scalar A E F is called an eigenvalue of T if there is a v f 0 (called an eigenvector) 
such that T(T/) = Av\ 

319 Example Shew that if A is an eigenvalue of T : V — > V, then A k is an eigenvalue of T k : V — » V, for 
k € N \ {0}. 


Solution: ► Assume that T(v) = Av. Then 

T 2 (T/) = TT(T/ ) = T(Av) = AT(T/) = A(Av) = A 2 T/. 

Continuing the iterations we obtain T k (A? J = A k V, which is what we want. ◄ 

320 Theorem Let A E M nxn (F) be the matrix representation of T : V — > V. Then A E F is an eigenvalue 
of T if an only if det(AI n — A) = Op. 

Proof: A is an eigenvalue of A E=4 there is F such that A\t = AA? A V* — AT/ = ~0 

AI n T/ — AT/ = 0 det(AI n — A) = Op by Corollary 314.0 


321 Definition The equation 


det(AI n — A) = Op 

is called the characteristic equation of A or secular equation of A. The polynomial p(A) 
the characteristic polynomial of A. 


det(AI n 


A) is 
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322 Example Let A 


110 0 
110 0 
0 0 11 
0 0 11 


Find 


O The characteristic polynomial of A. 
© The eigenvalues of A. 

© The corresponding eigenvectors. 


Solution: ► We have 


O 


det(AI 4 



A- 1 

-1 

0 

0 





-1 

A- 1 

0 

0 




= det 









0 

0 

A - 1 

-1 





0 

0 

-1 

A- 1 






A- 1 

0 

0 


-1 

0 0 

= (A- 

1 ) det 

0 

A- 1 

-1 

+ det 

0 

A - 1 -1 



0 

-1 

A - 1 


0 

-1 A - 1 

= (A- 

1)((A — 

1 X (A - 

-I) 2 - 

1)) + 

(—((A — 

I) 2 

-1)) 

= (A- 

1)((A — 

-1)(A — 

2) (A)) 

- (A- 

-2)(A) 



= (A- 

2) (A) ((A — 1 ) 2 

-1) 






= (A — 2) 2 (A) 2 


© The eigenvalues are clearly A = 0 and A = 2. 
© If A = 0, then 


- 1-10 0 


0I 4 - A 


- 1-10 0 
0 0 - 1-1 


0 


0 


-1 


-1 
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This matrix has row-echelon form 

- 1-10 0 
0 0 - 1-1 
0 0 0 0 

0 0 0 0 

and if 


1 

1 

0 

0 


a 


0 

T 

T 

0 

0 


b 


0 

0000 


c 


0 

0000 


d 


0 


then c = — d and a = — b 


Thus the general solution of the system (OI4 — A)X = O nx i is 


If A = 2, then 


a 


1 


0 

b 

= a 

-1 

+ C 

0 

c 


0 


1 

d 


0 


-1 



1 

1 

0 

A 


-1 

1 

0 



0 

0 

1 



0 

0 

-1 


0 

0 

-1 

1 


This matrix has row-echelon form 


-110 0 
0 0 1-1 
0 0 0 0 

0 0 0 0 


and if 
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1-10 0 


a 


0 

0 0-11 


b 


0 

0 0 0 0 


c 


0 

0 0 0 0 


d 


0 


then c = d and a = b 

Thus the general solution of the system (2I4 — A)X = O nx i is 


a 


1 


0 

b 

= a 

1 

+ c 

0 

c 


0 


1 

d 


0 


1 


Thus for A = 0 we have the eigenvectors 


1 


0 

-1 


0 

0 

) 

1 

0 


-1 


and for A = 2 we have the eigenvectors 


1 


0 

1 


0 

0 

> 

1 

0 


1 


323 Theorem If A = Of is an eigenvalue of A, then A is non-invertible. 

Proof: Put p(A) = det(AI n — A). Then p(0p) = det(— A) = (—1 ) n det A is the constant term of 

the characteristic polynomial. If A = Oh is an eigenvalue then 

p(0p) = Of => detA = 0p, 

and hence A is non-invertible by Theorem 313. □ 

324 Theorem Similar matrices have the same characteristic polynomial. 


Diagonalisability 
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Proof: We have 


det(AI n - SAS- 1 ) 


from where the result follows. □ 


det(ASI n S 1 - SAS- 1 ) 

det S(AI n — A)S _1 

(det S)(det(AI n — A)) (det S _1 ) 

(detS)(det(AIn-A)) 

det (Ain - A), 


Homework 


Problem 7.2.1 Find the eigenvalues and eigenvectors of 

1 -1 

-1 1 

Problem 7.2.2 Let A be a 2 x 2 matrix over some some 
field F. Prove that the characteristic polynomial of A is 

A 2 — (tr (A))A + det A. 

Problem 7.2.3 A matrix A £ M 2 X 2 (R) satisfies tr (A) = 
—1 and det A = —6. Find the value of det(l 2 + A). 

Problem 7.2.4 A 2 x 2 matrix A with real entries has 
characteristic polynomial p(A) = A 2 + 2 A — 1 . Find the 
value of det (212 + A). 


Problem 7.2.5 Let A 


0 

2 

-1 


-1 

-2 

0 


. Find 


O The characteristic polynomial of A. 

© The eigenvalues of A. 

© The corresponding eigenvectors. 

Problem 7.2.6 Describe all matrices A £ M 2 X 2 ( 
ing eigenvalues 1 and —1 . 


hav- 


Problem 7.2.7 Let A £ M nxu (R). Demonstrate that A 
has the same characteristic polynomial as its transpose. 


7.3 Diagonalisability 

In this section we find conditions for diagonalisability. 

325 Theorem Let {am ,f>2, ■ ■ ■ ,Vk} C V be the eigenvectors corresponding to the different eigenvalues 
(Ai , A2, . . . , Aic} (in that order). Then these eigenvectors are linearly independent. 

Proof: Let T : V — 1 V be the underlying linear transformation. We proceed by induction. 

For k = 1 the result is clear. Assume that every set of k — 1 eigenvectors that correspond 
to k — 1 distinct eigenvalues is linearly independent and let the eigenvalues Ai , A2, . . . , A k _i 
have corresponding eigenvectors v?i, V2,...,"^k-i- Let A be a eigenvalue different from the 
Ai , A2, . . . , Aic_i and let its corresponding eigenvector be V. Ifv were linearly dependent of the 
,^2, • • • ,Vk-i . we would have 

XV + xtVi + X2V2 + • • • + Xk-iVk-i ="(?. (7.1) 


Now 

T(xv +X1V1 +X2V2 
by Theorem 240 . This implies that 


Xk-iV k _i) =T(~^) =~0, 


xAv +X1A1 V] +X2A2V2 


Xk-i A k -i v k _i = 0 . 


(7.2) 
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From 7.2 take away A times 7.1, obtaining 

(Ai — A)^i + %2(A2^2 + • • • + %k-i (Aic_i — A^k-i = 0 (7.3) 

Since A — A t y Of 7.3 is saying that the eigenvectors V ^ , ~\} 2} ■ ■ ■ ,^k-i ewe linearly dependent, 
a contradiction. Thus the claim follows for k distinct eigenvalues and the result is proven by 
induction. □ 


D 


such that 


326 Theorem A matrix A E M nXTl (F) is diagonalisable if and only if it possesses n linearly independent 
eigenvectors. 

Proof: Assume first that A is diagonalisable, so there exists P E GL n (F) and 


Ai 0 0 •• • 0 

0 A 2 0 • • • 0 

0 0 0 • • • A n 

Ai 0 0 •• • 0 

0 A 2 0 • • • 0 

0 0 0 • • • An 

At 0 0 •• • 0 

0 A 2 0 • • • 0 

0 0 0 • • • A n 


P 7 ap = 


Then 


[APt ; AP 2 ; • • • ;AP n ] = AP = P 


= [A t P t ; A2P2; ' • • ; A n Pn], 


where the P k are the columns o/P. Since P is invertible, the P k are linearly independent by 
virtue of Theorems 204 and 313. 

Conversely, suppose now that V 1 , . . . , v' n are n linearly independent eigenvectors, with corre- 
sponding eigenvalues Ai , A 2 , . . . , A n . Put 


P = [T?i;...;^ n ], D = 


At 0 0 

0 A 2 0 

0 0 0 


Diagonalisability 
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Since A = Ao?t we see that AP = PD. Again P is invertible by Theorems 204 and 313 since 
the V k are linearly independent Left multiplying by P 1 we deduce P 1 AP = D,/rom where A 
is diagonalisable. □ 

327 Example Shew that the following matrix is diagonalisable: 

1 -1 -1 
1 3 1 

-3 1 -1 

and find a diagonal matrix D and an invertible matrix P such that 

A = PDP 1 . 


Solution: ► Verify that the characteristic polynomial of A is 

A 3 - 3A 2 -4A + 12 = (A - 2) (A + 2) (A - 3). 
The eigenvector for A = —2 is 

1 

-1 • 

4 

The eigenvector for A = 2 is 

-1 
0 ■ 

1 

The eigenvector for A = 3 is 

-1 
1 ■ 

1 


We may take 


We also find 


◄ 


-2 

0 

0 


1 

-1 

4 

0 

2 

0 

, P = 


0 

1 

0 

0 

3 


-1 

1 

1 


1 -1 1 

5 5 

0 -1 1 


1 0 1 

5 u 5 
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Homework 


Problem 7.3.1 Let A be a 2 x 2 matrix with eigenval- 

1 


ues 1 and —2 and corresponding eigenvectors 


and 


1 

-1 


respectively. Determine A 1 


Problem 7.3.2 Consider the matrix A 


9 -4 

20 -9 


1. Find the characteristic polynomial of A. 

2. Find the eigenvalues of A. 

3. Find the eigenvectors of A. 


4. II A 1 


a b 
c d 


find a + d. 


Problem 7.3.3 Let A g M 3 X 3 (R) have characteristic 
polynomial 


(A + 1) z (A-3). 


One of the eigenvalues has two eigenvectors 


1 

0 and 
0 


. The other eigenvalue has corresponding eigenvec- 


1 
1 

0 j 

1 

tor ■) 

1 

Problem 7.3.4 Let 


. Determine A. 


0 0 0 1 
0 0 10 
0 10 0 
10 0 0 


1. Find det A. 

2. Find A- 1 . 

3. Find rank ( A — I 4 ) . 

4. Find det(A -I 4 ). 

5. Find the characteristic polynomial of A. 

6. Find the eigenvalues of A. 

7. Find the eigenvectors of A. 

8. Find A 10 . 

Problem 7.3.5 Consider the matrix 


1 a 1 

A = 0 1 b 

0 0c 


O Find the characteristic polynomial of A. 

© Explain whether A is diagonalisable when a = 
0, c = 1. 

© Explain whether A is diagonalisable when a =/ 
0, c - 1. 

© Explain whether A is diagonalisable when c y" 1 . 


Problem 7.3.6 Find a closed formula for A n , if 


A = 


-7 -6 

12 10 


Problem 7.3.7 Let U g M nX n 

whose entries are equal to 1 . 

1. Demonstrate that U 2 = nil. 

2. Find detll. 

3. Prove that det(AI n — 11) = A n 1 (A — n). 

4. Shew that dim ker (U) = n — 1 . 

5. Shew that 


be a square matrix all 


U = P 


n 0 
0 0 

0 0 


p \ 


Theorem of Cayley and Hamilton 
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where 

11 0 ••• 0 0 

10 1 ••• 0 0 

100-.: : 

P = 

10 0 ••• 0 1 

1 _1 _1 ... _1 


7.4 Theorem of Cayley and Hamilton 

328 Theorem (Cayley-Hamilton) A matrix A 6 M n (F) satisfies its characteristic polynomial. 

Proof: Put B = AI n — A. We can write 

det B = det(AI n — A) = A n + bi A n_1 + b 2 A n ~ 2 H h b n , 

as det(AI n — A) is a polynomial of degree n. 

Since adj ( B) is a matrix obtained by using (n — 1 ) x (n — 1 ) determinants from B, we may write 

adj (B) = A n 1 B n i +A n 2 B n 2 + --- + B 0 . 

Hence 


det(AI n -A)I n = (B)(adj(B)) = (AI n — A) (adj (B)), 

from where 

A n I n + bTl n A n 1 + b2lnA n 2 + • • • + bnln = (AIn — A) (A n ^n-i+A' 1 2 B n 2 + • • • + Bo ) . 
By equating coefficients we deduce 


In = B n _i 

hi In = A 11 u ] + B n 2 

b 2 In = — ABn 2 + B n 3 


b n — 1 1n = — ABi + Bo 
bnln = 


AB 0 . 
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Multiply now the k-th row by A n k (the first row appearing is really the O-th row): 

A n = A n B n _i 

brA 1 "- 1 = -A n B n _ 1 +A n - 1 B n _ 2 

b 2 A n - 2 = -A n - 1 B n _ 2 +A n - 2 B n _ 3 


b n _i A = — A 2 Bi + ABo 
bnln ABq. 


Add all the rows and through telescopic cancellation obtain 

A n + bi A n ^ + • • • + b n _i A + b n I n = O n , 

from where the theorem follows. □ 

329 Example From example 327 the matrix 

1 -1 -1 
1 3 1 

-3 1 -1 

has characteristic polynomial 

(A -3) (A -2) (A + 2) = A 3 — 3A 2 — 4A + 1 2, 
hence the inverse of this matrix can be obtained by observing that 


A 3 — 3A 2 — 4A + 1 2 I 3 = O 3 A - 1 


-1 (A2 -3A- 4I 3) 


1/3 

1/6 

- 1/6 

1/6 

1/3 

1/6 

-5/6 

- 1/6 

-1/3 


Homework 


Problem 7.4.1 A 3 x 3 matrix A has characteristic polynomial A(A — 1 ) (A + 2). What is the characteristic polynomial 
of A 2 ? 


Chapter 


8 


Linear Algebra and Geometry 


8.1 Points and Bi-points in 


is the set of all points A = 


U,\ 


with real number coordinates on the plane, as in figure 8.1. We use 


the notation O = 




V7 


\ a2 J 

to denote the origin. 


A = 




\°v 


-f 

i 

i 


O 


Figure 8.1: Rectangular coordinates in R 2 . 



( \ 


/ \ 



ai 


bt 


Given A = 

v a2 y 

6 M 2 and B = 

W 

E I# 2 we define their addition as 


A + B = 

H 

+ 

H 


ai + bi 


\ ai j 


W 


^a 2 + b 2 j 
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Similarly, we define the scalar multiplication of a point of R 2 by the scalar a 6 R as 


aA = a 


/ N 


f \ 

ai 

_ 

aai 

V% 


\ aa 2j 


( 8 . 2 ) 


Throughout this chapter, unless otherwise noted, we will use the convention that a point 
A E R 2 will have its coordinates named after its letter, thus 


A = 




\ a v 


330 Definition Consider the points A 6 R 2 , B 6 R 2 . By the bi-point starting at A and ending at B, denoted 
by [A, B], we mean the directed line segment from A to B. We define 


[A, A] = O 


M 

W 


The bi-point [A, B] can be thus interpreted as an arrow starting at A and finishing, with 
the arrow tip, at B. We say that A is the tail of the bi-point [A, B] and that B is its head. Some 
authors use the terminology “fixed vector” instead of “bi-point.” 

331 Definition Let A f B be points on the plane and let L be the fine passing through A and B. The 
direction of the bi-point [A, B] is the direction of the fine L, that is, the angle 0 6 ] — ■J; j] that the line L 
makes with the horizontal. See figure 8.2. 


332 Definition Let A, B fie on line L, and let C, D lie on fine L'. If L||L' then we say that [A, B] has the 
same direction as [C, D]. We say that the bi-points [A, B] and [C, D] have the same sense if they have 
the same direction and if both their heads fie on the same half-plane made by the line joining their 
tails. They have opposite sense if they have the same direction and if both their heads fie on alternative 
half-planes made by the fine joining their tails. See figures 8.3 and 8.4 . 




Figure 8.2: Direction of a 
bi-point 


Figure 8.3: Bi-points 

with the same sense. 


Figure 8.4: Bi-points 

with opposite sense. 
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Bi-point [B, A] has the opposite sense of [A, B] and so we write 

[B, A] = —[A, B], 

333 Definition Let A^B. The Euclidean length or norm of bi-point [A, B] is simply the distance between 
A and B and it is denoted by 

l|[A, B]|| = y/(ai -b 1 ) 2 + (a 2 -b 2 ) 2 . 

We define 

ll[A, A]|| = ||0|| = 0. 

A bi-point is said to have unit length if it has norm 1 . 


A bi-point is completelg determined by three things: (i) its norm, (ii) its direction, and (iii) its 
sense. 

334 Definition (Chasles’ Rule) Two bi-points are said to be contiguous if one has as tail the head of the 
other. In such case we define the sum of contiguous bi-points [A, B] and [B, C] by Chasles’ Rule 

[A, B] + [B , C] = [A, C], 

See figure 8.5. 

335 Definition (Scalar Multiplication of Bi-points) Let A e M \ {0} and A f B. We define 

0[A, B] = O 

and 

A[A, A] = O. 

We define A[A, B] as follows. 

1. A[A, B] has the direction of [A, B]. 

2. A[A, B] has the sense of [A, B] if A > 0 and sense opposite [A, B] if A < 0. 

3. A[A,B] has norm |A|||[A,B]|| which is a contraction of [A, B] if 0 < |A| < 1 or a dilatation of [A,B] if 

|A| > 1 . 

See figure 8.6 for some examples. 


B 


A 



C 


-2[A, B] 



Figure 8.5: Chasles’ Rule. 


Figure 8.6: 
points. 


Scalar multiplication of bi- 
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8.2 Vectors in M 2 

336 Definition (Midpoint) Let A, B be points in K 2 . We define the midpoint of the bi-point [A, B] as 

^ a i +b i ^ 


A + B 




337 Definition (Equipollence) Two bi-points [X, Y] and [A, B] are said to be equipollent written [X, Y] 
[A, B] if the midpoints of the bi-points [X, B] and [Y, A] coincide, that is. 


[X, Y] ~ [A, B] & 


X+B Y+A 


See figure 8.7. 


Geometrfcalfy, equfpoflence means that the quadrflateraf XYBA fs a paralfefogram. Thus the bf-pofnts 
[X, Y] and [A, B] have the same norm, sense, and dfrectfon. 


Y 



Ffgure 8.7: Equfpolfent bf-pofnts. 


338 Lemma Two bf-pofnts [X, Y] and [A, B] are equfpolfent If and only If 


/ \ 


(u \ 

yi — 

= 

bi a-i 

v y 2 -x 2y 


^b 2 — a 2 j 


Proof: This is immediate, since 


[X, Y] ~ [A, B] 


^ ai +y ! ^ 


^bi +xi ^ 

2 

_ 

2 

ai+yz 

V 2 y 


b 2 +x 2 

V 2 / 

/ 

\ 

(u 

yi — X! 

bi - c 


\V2 -x 2 y 


v b 2 — a 2 j 


as desired. □ 

From Lemma 338, equipollent bi-points have the same norm, the same direction, and the 
same sense. 


339 Theorem Equipollence Is an equivalence relation. 
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Proof: Write [X, Y] ~ [A, B] if [X, Y] if equipollent to [A, B]. Now [X, Y] ~ [X, Y] since 


( \ 

yi — xi 


yy 2 -x 2 y 


( \ 

yi — xi 

^2 -* 2 ) 


and. so the relation is reflexive. Also 

( 


[X, Y] ~ [A, B] 


and the relation is symmetric. Finally 

[X, Y] ~ [A, B] A [A, B] ~ [U, V] 


/ 


[A, B] ~ [X, Y], 




^,-cd 

X2y 


yb 2 — Cl2y 

ai 


( \ 
yi xi 

ai > 


y y2 X 2 y 


( \ 


/, \ 

yi X"i 


bi - a-\ 

v y2 -x 2y 


^b 2 — a 2 j 


/ 


A 


\ 


bi - on 
\b 2 - a 2 j 


( 


\ 


Vi - U! 
\v 2 — au 2 J 


( \ 


f \ 

yi xi 


Vi — Ui 

v y2 -x 2y 


^V 2 - VL 2) 


[X,Y]~[U,V], 


and the relation is transitive. □ 

340 Definition (Vectors on the Plane) The equivalence class in which the bi-point [X, Y] falls is called the 
vector (or free vector) from X to Y, and is denoted by XY. Thus we write 


yi —xt 

y2 — X 2 

If we desire to talk about a vector without mentioning a bi-point representative, we write, say, V, thus 
denoting vectors with boldface lowercase letters. If it is necessary to mention the coordinates of V we 
will write 

vi 


[X, Y] 6 XY = 


V2 


} 

V 
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For any point X on the plane, we have X>t = , the zero vector. If[X,Y\ E 7 then [Y, X] E 

— v. 


341 Definition (Position Vector) For any particular point P 


L\ 


v 2 / 


E l 2 we may form the vector OP = 


Pi 


We call OP the position vector of P and we use boldface lowercase letters to denote the equality 


P2 

Ol^. 


342 Example The vector into which the bi-point with tail at A = 


(-') 


I 3 ) 


and head at B = 


\ 2 J 


W 



3- (-1) 


4 

AB = 

= 



4-2 


2 


falls is 


343 Example The bi-points [A, B] and [X, Y] with 


/ \ / \ 

-1 3 


A = 


> B = 


V 2 / 


x = 




VJ 


, Y = 


w 

f 7 \ 


VJ 


represent the same vector 


A^ = 

3- (-1) 


4 


7-3 


4-2 


2 


9-7 


In fact, if S = 


C 1+n \ 


, T = 


v 2+ - 

of the vectors AE> = XT' = sf. 


7 


( \ 

3 + n 


\4 + my 


= XY. 


then the infinite number of bi-points [S , T] are representatives of 


Given two vectors it, 7 we define their sum it +7 as follows. Find a bi-point representative AB e it 
and a contiguous bi-point representative BC? E 7. Then by Chasles’ Rule 

it+Tt=A^ + B^ = A^. 

Again, by virtue of Chasles’ Rule we then have 

Af> = A(3 + O^ = ~oX + O^ = T> -~ti 


(8.3) 
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Similarly we define scalar multiplication of a vector by scaling one of its bi-point representatives. We 
define the norm of a vector At 6 R 2 to be the norm of any of its bi-point representatives. 


Componentwise we may see that given vectors At = 
sum and scalar multiplication take the form 

At+At = 


U! 

U 2 


, At = 


Vi 

v 2 


and a scalar A £ R then their 


Ul 


V-l 

II 

Aui 


+ 



u 2 


v 2 


Au 2 



Figure 8.8: Addition of Vectors. 


Figure 8.9: Scalar multiplication of vectors. 


344 Example Diagonals are drawn in a rectangle ABCD. If All = 5? and A(i = A/, then fid = it — At, 
CE) = —it, DA =lt — l/> and BE) = l| — 2xP 

345 Definition (Parallel Vectors) Two vectors it and v are said to be parallel if there is a scalar A such 
that it = Ait. If Ti is parallel to At we write it | pit. We denote by Mit = {ait ; a 6 M), the set of all vectors 
parallel to At. 

0 is parallel to every vector. 


346 Definition If it 


ui 

u 2 


, then we define its norm as | |At 1 1 = y u 2 + u| . The distance between two 
vectors At and At is d(it,At) = 1 1 it — At 1 1 . 

347 Example Let a E R, a > 0 and let At =/"(?. Find a vector with norm a and parallel to At. 


At 

Solution: ► Observe that has norm 1 as 


VT 


At 

W\ 


||At|| 

M 


= i. 


At .At 

Hence the vector a , . has norm a and it is in the direction ofv. One may also take — a-pp^r 

v v 
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348 Example If M is the midpoint of the bi-point [X, Y] then XM = MY from where XM = jXY. Moreover, 
if T is any point, by Chasles’ Rule 


TX + TY 


T)\t + mX + T)vt + mX 
2TM - XM + MY 

2fhl. 


349 Example Let AABC be a triangle on the plane. Prove that the line joining the midpoints of two sides 
of the triangle is parallel to the third side and measures half its length. 

Solution: ► Let the midpoints of [A, B] and [A, C] be Me and Mb, respectively. We shew that 
Bt! = 2McM|{. We have 2AMc = AE> and 2AM b = ACL Thus 

B^ = bX + A^ 

= — Af> + A& 

= -2AMc + 2AMb 
= 2M C A + 2AMb 
= 2(M C X + AMb) 

= 2 M c M b , 


as we wanted to shew. ◄ 

350 Example In AABC, let Me be the midpoint of side AB. Shew that 

cmc = j (cX + c S) . 

Solution: ► Since AMc = Mc^, we have 

CA ~h C li = CMc IVLcA + CMc M ( ■ B 
= 2CMc — AMc + Mc^ 

= 2CMc, 

which yields the desired result. ◄ 

351 Theorem (Section Formula) Let APB be a straight line and A and p be real numbers such that 

ll[A, P]|| A 
l|[P,B]|| p' 

WithYi = oX, X = OT>, andfl = Ol^, then 

v Al? + pl2 
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Proof: Using Chasles’ Rule for vectors, 

A^ = A 6 + 0^ = ^+^, 

= a 6 + = -it +^. 

Also, using Chasles’ Rule for bi-points, 

[A,P]p = A([P, B]) = A([P,A] + [A,B]) = A(-[A,P] + [A,B]), 

whence 

[A, P] = —— — [A, B] =$ A^ = — ^ — AE> =* = — (t --tf). 

A + p A + p A + p 

On combining these formulae 

(A + p)(V — 1 ?) = A(l) — 1 ?) =^> (A + p)V = Al? + pli, 

from where the result follows. □ 



Figure 8.13: [D]. Problem 

8 . 2 . 6 . 


Figure 8.14: [E] . Problem 

8 . 2 . 6 . 


Figure 8.15: [F] . Problem 

8 . 2 . 6 . 


Homework 


Problem 8.2.1 Let a be a real number. Find the dis- 



1 


1 - a 

tance between 

a 

and 

1 


where v = 


1 

-1 


Problem 8.2.2 Find all scalars A for which ||Av?|| = 


Problem 8.2.3 Given a pentagon ABCDE, find All + 

+ CE> + Lit + EX. 


1 _ 

2 ’ 
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Problem 8.2.4 For which values of a will the vectors 


a + 1 

— > 

2a + 5 

a 2 — 1 

, b = 

a 2 — 4a -)- 3 


will be parallel? 

Problem 8.2.5 In A ABC let the midpoints of [A, B] and 
[A, C] be Me and Mb, respectively. Put McB = 5c , 
Mb C = l/ , and cX = X . Express [A] All + BC? + Me Mb , 
[B] AMc + M c Mb + M B C, [C] At + Me A - BMb in 
terms of x? , y\ and 7. 

Problem 8.2.6 A circle is divided into three, four equal, 
or six equal parts (figures 8.10 through 8.15). Find the 
sum of the vectors. Assume that the divisions start or 
stop at the centre of the circle, as suggested in the fig- 
ures. 

Problem 8.2.7 Diagonals are drawn in a square (figures 
?? through ??). Find the vectorial sum cl -\- b -|- 7. 
Assume that the diagonals either start, stop, or pass 
through the centre of the square, as suggested by the 
figures. 


Problem 8.2.8 Prove that the mid-points of the sides of 
a skew quadrilateral form the vertices of a parallelogram. 


Problem 8.2.9 ABCD is a parallelogram. E is the mid- 
point of [B, C] and F is the midpoint of [D, C]. Prove that 

AC^ + BE) = 2B(1 

Problem 8.2.10 Let A,B be two points on the plane. 
Construct two points I and J such that 

fX = -3I^, jX = — 1 fg, 

and then demonstrate that for any arbitrary point M on 
the plane 

MA + 3 Mil = 4MI 

and 

3MA + MB = 4MJ. 

Problem 8.2.11 You find an ancient treasure map in 
your great-grandfather’s sea-chest. The sketch indicates 
that from the gallows you should walk to the oak tree, 
turn right 90° and walk a like distance, putting and x at 
the point where you stop; then go back to the gallows, 
walk to the pine tree, turn left 90°, walk the same dis- 
tance, mark point Y. Then you will find the treasure at 
the midpoint of the segment XY. So you charter a sailing 
vessel and go to the remote south-seas island. On ar- 
rival, you readily locate the oak and pine trees, but un- 
fortunately, the gallows was struck by lightning, burned 
to dust and dispersed to the winds. No trace of it re- 
mains. What do you do? 


8.3 Dot Product in M 2 

352 Definition Let E (R 2 ) 2 . The dot product^ ofli andlt is defined by 



ai 

• 

bi 


a 2 


b2 


ciibi + azh>2- 


The following properties of the dot product are easy to deduce from the definition. 

DPI Bilinearity 

(X +X)*X =X»X +X*X, X»X +X) =XX +X.X 

(8.5) 

DP2 Scalar Homogeneity 

(aX)X = X • ( at)* ) = a(X»X), ct E R. 

(8.6) 

DP3 Commutativity 


(8.7) 

DP4 

X»X > o 

(8.8) 

DP5 

X»X = o <4-X = "o 

(8.9) 


Dot Product in R 2 


DP6 

||x|| = \/?C OC 


353 Example If we put 


— » 

1 

— > 

0 

i = 

0 

, ) = 

1 


then we can write any vector cf = 


0-1 

a 2 


as a sum 


— 4 + 

Ct = 0-1 t + 0 . 2 ). 


The vectors 


satisfy = 0, and 




— > 

1 

— > 

0 

i = 

0 

, i = 

i 


354 Definition Given vectors cf and b , we define the angle between them, denoted by ( c? , b) , 
between any two contiguous bi-point representatives of and b . 


355 Theorem 


s^«b = ir^Hllb || cos (l£, b ). 


Proof: Using Al-Kashi’s Law of Cosines on the length of the vectors, we have 

ift> -^ii 2 = ir^n 2 + irt^ii 2 - 2ir<iiiiii) ii cos ni,i>) 

<4 (~t> -^) = ll^ll 2 + lft>ll 2 — 2 |r« 2 ||HT>||cos 

<=> 17*11 — 2 lt*l 7 + 7 t* 7 t = II^H 2 + |ft >|| 2 — 2 |[ci|||ft>|| cos 
<4 | 1 j || 2 - 2 a*t> + ||t >|| 2 = ir^ll 2 + |fb || 2 — 2 |fci|||fb || cos ( 1 ?, t>) 
<=>~tl*i7 = irc?iiift>iicos 
as we wanted to shew. □ 

Putting (ll,!)) — ^ in Theorem 355 we obtain the following corollary. 


356 Corollary Two vectors in R 2 are perpendicular if and only if their dot product is 0. 
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Figure 8.19: Theorem 355. 


357 Definition Two vectors are said to be orthogonal if they are perpendicular. If li is orthogonal t o b , we 
write ^ _L b. 


--4 


1 1 — ^ 1 1 

-4 

a. = 

b 


. -4 


It follows that the vector is simultaneously parallel and perpendicular to any vector! 


v ~ . 

359 Definition Let a 6 r be fixed. Then the orthogonal space to a is defined and denoted by 

~cf' L = (x e K 2 : ~x 1 




Since | cos 0 1 < 1 we also have 

360 Corollary (Cauchy-Bunyakovsky-Schwarz Inequality) 

^ < H^l 


361 Corollary (Triangle Inequality) 


Proof: 


ll + 1) < I Hi 


ir^ +^ii 2 = +^).(^+^) 

= + l> •!> 

< ir^ii 2 + 2ii^initii + iiti! 2 

= (ir^ii + ir^ii) 2 , 

from where the desired result follows. □ 

362 Corollary (Pythagorean Theorem) If ^ _L 1? then 

itf +i?ii 2 = ir^ll 2 + t> 


Proof: Since cl • b = 0, we have 


= + 1 ? ) . cd + ; 

= + 1? •!) 
= + o + •!> 


= a 


ir^ +^n 2 
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from where the desired result follows. □ 

363 Definition The projection of ~t onto V (or the ^-component of t ) is the vector 


proj4 = (cos ( t 


1 


FIT*’ 


where (v , t ) E [0; 7t] is the convex angle between V 1 and t' read in the positive sense. 


Given two vectors 7 and vector T? f ~0 , find bi-point representatives of them having a 
common tail and join them together at their tails. The projection of t onto V is the “shadow” of 
7 in the direction of^ . To obtain proj f we prolong^ if necessary and drop a perpendicular line 
to it from the head of t . The projection is the jportion between the common tails of the vectors 
and the point where this perpendicular meets t . See figure 8.20. 




Figure 8.20: Vector Projections. 


364 Corollary Let^ y'Tb Then 


projl 


cos 


(bc^lihrxh 


pn 



365 Theorem Let ll E R 2 \ { 0 }. Then any ^gR 2 can be decomposed as 


where E R cZ and V* E ct^. 


— ) 

X 




— ) 

v, 


Proof: We know that proj ^ is parallel to cZ , so we lake rf = proj . This means that we must 

then take 7 - x 2 proj * . We must demonstrate that 7 is indeed perpendicular to 7 . But this 
is clear, as 


a*v = a*x 


= 


7*proj | 


R.a - 


m 2 


= a*x — x»a 


= 0, 


completing the proof. □ 


366 Corollary Let V li^be non-zero vectors in R 2 . Then any vector iZ E R 2 has a unique representation 
as a linear combination of 7’, vt> , 

"7 = s~7 + tti, (s, t) E R 2 . 
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Proof: By Theorem 365, there exists a decomposition 

= s^ + s'^ / , 

inhered' is orthogonal to V . But thenf}'\\x& and hence there exists agl with T?' = ah’. Taking 
t = s ' a we achieve the decomposition 

if = slf + tv^. 


To prove uniqueness, assume 

sJ + tr^ = if = plf + quf 

Then (s - p)V = (q - t)v$. We must have s = p and q = t since otherwise T? would be parallel 
to This completes the proof . □ 

367 Corollary Let p' , q be non-zero, non-parallel vectors in M 2 . Then any vector ci 6 R. 2 has a unique 
representation as a linear combination of^jf ,lf , 

if = Ijf + mlf , (l,m) gl 2 . 


Proof: Consider ~2 = if proj | . Clearly jf _L and so by Corollary 366, there exists unique 
(s, t) 6 R 2 such that 

if = + t2 

- sp - tproj fj + t cf 

- 

establishing the result upon choosing l = s — t-lLjl and m = t. □ 


t = 


Write V as the sum of two vectors, one parallel to if and the other 


368 Example Let V = 

perpendicular to if . 

Solution: ► We use Theorem 365. We know that proj 'j is parallel to if , and we find 


P r °J^ 


.f -4 3 




We also compute 


? proj 


1 

3 


2 

5 

_ 

5 

1 

6 


1 

5 


5 


6 6 

= 25 _ 25 = ’ 


Observe that 
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and the desired decomposition is 


1 


3 


2 

= 

5 

+ 

5 


1 


6 


1 



5 


5 



Figure 8.21: Orthocentre. 


369 Example Prove that the altitudes of a triangle A ABC on the plane are concurrent. This point is 
called the orthocentre of the triangle. 


Solution: ► Put. cf = oX, b = Oe£,7? = OC. First observe that for any 7, we have, upon 
expanding, 

(3? — “c£)«(T> — "7) + (t? — )•("<? — it) + (~7 — — t>) = 0. (8.11) 

Let H be the point of intersection of the altitude from A and the altitude from B. Then 

o = at1.c^ = (oil -oX). (oil-ocb = (oii ( 8 . 12 ) 


and 


0 = BH 


.A^ = 


OH- OB).( 


.(0(X 


OA) = (OH - b).f2 -~ct) 


Putting = Ol4 in (8.11) and subtracting from it (8.12) and (8.13), we gather that 


(8.13) 


o = (opi -^).f3 -1?) = crt.A^, 


which gives the result. ◄ 


Homework 


Problem 8.3. 1 Determine the value of a so that 


a 

— a 


be perpendicular to 


i 

-1 


Problem 8.3.2 Demonstrate that 

(tj+i? =~(?)A(||"cf|| = ||i>||) r^-t).(^-^) = 0. 


I? as the sum of two vectors, one parallel to 7 and the 
other parallel to . 


Problem 8.3.4 Prove that 



Problem 8.3.5 Lei cl f 0 fb be vectors in R 2 such that 
h.b =0. Prove that 

ac( + (3lj = 7 a = p = 0. 


Problem 8.3.3 Let f = 

4 

,T = 

-1 

, 1? = 

2 

. Write 


5 


1 


1 










Problem 8.3.6 Let (7, TJ' ) g (R 2 ) 2 with ||7|| = ||f^ 
Shew that 2 7 + 3T( and 2 7 — 3T( are perpendicular. 
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Problem 8.3.7 Let ll, b be fixed vectors in R 2 . Prove 
that if 

V\t £ R 2 , 8? f .1) , 

then li = b . 

Problem 8.3.8 Let £ (R 2 ) 2 . Prove that 

||^ + t|| 2 + ||li-t|| 2 =2||^|| 2 +2||t|| 2 . 

Problem 8.3.9 Let it, 8} be vectors in R 2 . Prove the po- 
larisation identity: 

if .1? = 1 (ifrf +^|[ 2 _ |prf -4?|| 2 ) . 


Problem 8.3.10 Let be non-zero vectors in R 2 . 

Prove that 

P r °j| . 

proj = oCa, 

with 0 < a < 1 . 

Problem 8.3.11 Let (A,l2) £ R X R 2 be fixed. Solve the 
equation 

-&•!? = A 

for ^ £ R 2 . 


8.4 Lines on the Plane 

370 Definition Three points A, B, and C are collinear if they lie on the same line. 


It is clear that the points A, B, and C are collinear if and only if AE> is parallel to A(i Thus we have the 
following definition. 

371 Definition The parametric equation with parameter t £ M of the straight line passing through the 


point P = 


u 


in the direction of the vector is 


\P2j 


X — pi 

y -P2 


tJ. 


itl* = 


then the equation of the line can be written in the form 


= (8.14) 

The Cartesian equation of a line is an equation of the form ax + by = c, where a 2 + b 2 f 0. We write 
(AB) for the line passing through the points A and B. 

372 Theorem Let. f and let. n' T V. An alternative form for the equation of the line f — ~p = tV is 

C? = 0. 


Moreover, the vector 



b 


is perpendicular to the line with Cartesian equation ax + by = c. 


Proof: The first part follows at once by observing that V'.n = 0 and taking dot products to both 

sides of 8.14. For the second part observe that at least one of a and b is f 0. First assume that 
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a f 0. Then we can put y = t and x = — ^ and the parametric equation of this line is 


X ~ a 


= t 


and we have 


b 



a 


a 

• 


1 


b 


a + b = 0. 


Similarly if b f 0 we can putx = t and y = - + £ and the parametric equation of this line is 




■ 

X 


1 


= t 


^ . c 


a 

y b 


b 


and we have 


= a — — • b = 0, 
b 


proving the theorem in this case. □ 
The vector 


Va z + b* 
b 

V a 2 +b 2 


has norm 1 and is orthogonal to the line ax + by = c. 


373 Example The equation of the line passing through A = 




vJ 


and in the direction of ~v = 


-4 

5 


is 


x — 2 
y -3 


= A 


374 Example Find the equation of the line passing through A = 


H 

and B = 

H 

1 J 


l 3 / 


Solution: ► The direction of this line is that of 


Aii = 

-2- (-1) 


-1 


3 - 1 


2 


The equation is thus 


x + 1 


-1 


= A 


y-i 


2 


, A £ R. 
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◄ 

375 Example Suppose that (m, b) E R 2 . Write the Cartesian equation of the line y = mx+b in parametric 
form. 


Solution: ► Here is a way. Putx = t. Then y = mt + b and so the desired parametric form is 


■ 


" 

X 

= t 

1 

1 

rD 

1 

1 


m 


◄ 

376 Example Let (mi , m2, bi , b2) € R 4 ,mim.2 f 0 . Consider the lines Li : y = mix + bi and L2 : y = 
m.2X + b2- By translating this problem in the language of vectors in R 2 , shew that Li _L L2 if and only if 
mi m 2 = —1 - 


Solution: ► The parametric equations of the lines are 



X 


1 


X 


i 

Li : 

y — bi 

= s 

mi 

, L 2 : 

y — b 2 

= t 

m 2 


Put^rf = 

1 

andit = 

1 


mi 


m 2 


Since the lines are perpendicular we must have V.w = 0 , which 


yields 


0 =V»v^ = l(l) + mi (m2) 


mi m 2 = —1 . 


◄ 


377 Theorem (Distance Between a Point and a Line) Let (~r — it) .it = 0 be a line passing through the 
point A and perpendicular to vector n. If B is not a point on the line, then the distance from B to the 
line is 


(it — l> ) « rt 


n 


If the line has Cartesian equation ax + by = c, then this distance is 


abi + bb2 — c| 
Va 2 + b 2 


Proof: Let Ro be the point on the line that is nearest to B. Then BRo = ro — b is orthogonal to 

the line, and the distance we seek is 


[projfT b ! 


(fo 


l(fo 

-lt).rt| 


|rt| 

2 

1 

|h| 

1 


Since Ro is on the line, root = it.rt, and so 


proj 


ro— f> I 




b-TCl 


a .if 


b.rtl 


(it — b ) • rt I 


w 


IFI 


M 


as we wanted to shew. 
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If the line has Cartesian equation ax + by = c, then at least one of a and b is y 0 . Let us suppose 
a /O, as the argument when a = 0 and b y 0 is similar. Then ax + by = c is equivalent to 


/ 



c 





A- 



a 


• 


V 

y 


0 

) 


b 


We use the result obtained above with c? 


V a 2 + b 2 and 


_C 

a 

, TX = 

a 

, and B = 

H 

. Then it 

0 


b 





\(~d 




• 

a 


— b 2 


b 


\c 


abi 


bb 2 | = |ab! + bb 2 — c|, 


giving the result. □ 

378 Example Recall that the medians of AABC are lines joining the vertices of AABC with the midpoints 
of the side opposite the vertex. Prove that the medians of a triangle are concurrent, that is, that they 
pass through a common point. 

This point of concurrencg is called, alternativelg, the isobarycentre, centroid, or centre of 
gravity of the triangle. 

Let Ma, Mb, and Me denote the midpoints of the lines opposite A, B. and C, 

The equation of the line passing through A and in the direction of AMa is (with 

y 

^r = OA + vAMa. 

Similarlg, the equation of the line passing through B and in the direction of BMb is 

“p 1 = Of> + sBMa- 

These two lines must intersect at a point G inside the triangle. We will shew that Chd is parallel 
to CMc, which means that the three points G, C, Me are collinear. 

Now, 3(ro, So) 6 R 2 such that 

OA + r oAMa - OGj — C 1 1 > + SoBAIb , 

that is 

roAMA — SoBMb = OB — OA, 
or 

ro(AE> + BMa) - s 0 (BA + AM B ) = AE>. 

Since Ma and Mb are the midpoints of[ B, C] and [C, A] respectivelg, we have 2BMa = BC^ and 
2AM b = A(! = AB + BC^. The relationship becomes 


Solution: ► 

respectivelg. 


= 


) 
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(r 0 + 


So 


-1)A^ = (- 


ro 

T 


so 

y 


Bel 


We must have 


r 0 


so 
2 

_ro So 

y + y 

since otherwise the vectors AB and Bd would be parallel, and the triangle would, be degenerate. 

Thus we have OA 


- 1 = 0 , 


= 0, 


Solving, we find So = r 0 = #. 
bX = §BMb. 


|AMa = OG, or AG = f AM a , and similarly, 


-4 


From Ad = |AM a , we deduce Ad = 2GM A . Since Ma is the midpoint o/[B, C], we have 
cTBl + Gd = 2 GM a = Ad, which is equivalent to 

gX + g^ + Gd =d. 

As Me is the midpoint of [A, B] we have GA + (iTlI = 2GMc. Thus 

d = GX + Gd + Gd = 2GMc + Gd. 

This means that Gd = 2GM~c, that is, that they are parallel, and so the points G, C and Me 
all lie on the same line. This achieves the desired result. ◄ 

The centroid of A ABC satisfies thus 

gX + g^ + Gd =d, 

and divides the medians on the ratio 2:1, reckoning from a vertex. 


Homework 


Problem 8.4.1 Find the angle between the lines 2x—y = 
1 and x — 3y = 1 . 


Problem 8.4.2 Find the equation of the line passing 



, ) 


2 

through 


and in a direction perpendicular to 



H 


1 


Problem 8.4.3 A ABC has centroid G, and AA'B'C' 
satisfies 

AA 1 + BB 1 + CC' =d. 

Prove that G is also the centroid of AA'B'C'. 

Problem 8.4.4 Let ABCD be a trapezoid, with bases 
[A, B] and [C, D]. The lines (AC) and (BD) meet at E and 
the lines (AD) and (BC) meet at E. Prove that the line 
(EE) passes through the midpoints of [A, B] and [C, D] by 
proving the following steps. 

O Let I be the midpoint of [A, B] and let J be the point 
of intersection of the lines (El) and (DC). Prove 
that J is the midpoint of [C, D], Deduce that E, I, J 
are collinear. 


© Prove that E , I , J are collinear. 

Problem 8.4.5 Let ABCD be a parallelogram. 

O Let E and E be points such that 

At = lXd and aI = |aX. 

4 4 

Demonstrate that the lines (BE) and (DE) are par- 
allel. 

© Let I be the midpoint of [A, D] and J be the mid- 
point of [B, C]. Demonstrate that the lines (AB) 
and (I J) are parallel. What type of quadrilateral is 

IEJE? 

Problem 8.4.6 ABCD is a parallelogram; point I is the 
midpoint of [A, B]. Point E is defined by the relation 
Hi = fib. Prove that 

a! = j (ab + ad) 

and prove that the points A, C, E are collinear. 
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Problem 8.4.7 Put OA = cl, Oil = b, OC = f. Prove 
that A, B, C are collinear if and only if there exist real 
numbers a, (3, y, not all zero, such that 

all + pli + y~c? =~S , a+(3+y = 0. 

8.5 Vectors in M 3 


Problem 8.4.8 Prove Desargues’ Theorem: If A ABC 

and AA'B'C' (not necessarily in the same plane) are 
so positioned that (AA'), (BB'), (CC') all pass through 
the same point V and if (BC) and (B'C') meet at L, (CA) 
and (C'A') meet at M, and (AB) and (A'B') meet at N, 
then L, M, N are collinear. 


We now extend the notions studied for R 2 to K 3 . The rectangular coordinate form of a vector in K 3 is 


ai 


~d = 


a 2 ■ 


a 3 


In particular, if 


— » 

i 


1 


0 


0 


— > 




0 

, j = 

1 

, = 

0 

0 


0 


1 


then we can write any vector fj = 


ai 

a 2 

a 3 


as a sum 


a = ai i 


a 2 ) 


a 3 k. 



ai 


hi 

Given a = 

a 2 

andl? = 

b 2 


a 3 


b 3 


their dot product is 


^«b = aibi + a 2 b 2 + a 3 b 3 , 


and 


We also have 


|f(i|| = yj af + a| + a 2 . 

-R rr> yt -A „ 

t.j = j.k=K*i =0, 


and 




= 1. 


379 Definition A system of unit vectors i , j , k is right-handed if the shortest-route rotation which brings 
i to coincide with j is performed in a counter-clockwise manner. It is left-handed if the rotation is done 
in a clockwise manner. 
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To study points in space we must first agree on the orientation that we will give our coordinate system. 
We will use, unless otherwise noted, a right-handed orientation, as in figure 8.22. 




Figure 8.22: Right-handed system. 


Figure 8.23: Left-handed system. 


The analogues of the Cauchy-Bunyakovsky-Schwarz and the Triangle Inequality also hold 
in R 3 . 


We now define the (standard) cross (wedge) product in R 3 as a product satisfying the following prop- 
erties. 

380 Definition Let a) 6 R 3 x R 3 x R 3 x R. The wedge product x : R 3 x R 3 —» R 3 is a closed 

binary operation satisfying 

CPI Anti-commutativity: 

^xi/ = -(Vx"?) (8.15) 

CP2 Bilinearity: 

(~x^ +l?)xl| = 5^xyf +l?xl/, =l?x^+^xT? (8.16) 

CP3 Scalar homogeneity: 

(o^)xV =^x(a^) = a(^x^) (8.17) 

CP4 

5?x5? ="6 (8.18) 


CP5 Right-hand Rule: 


ix j = k, j xk = i, kx i = j 


(8.19) 



Xl 


yi 

381 Theorem Let 5? = 

X 2 

andl/ = 

V2 


X 3 


y 3 


be vectors in R 3 . Then 


5?xi/ = (x 2 y 3 — x 3 y 2 ) i + (x 3 yi — xiy 3 ) j + (xiy 2 — x 2 y!)l<- 
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— ) — ) — ) — ^ ^ — ) — ) 

Proof: Since i x i = j x j = kxk = 0 we have 


— > 


(*i i +x 2 j +X 3 k)x(y! i + y 2 ) + y 3 k) = *iy 2 i x j + X!y 3 i x k 

+x 2 yi J x i +x 2 y 3 j xk 

~r^ ~T> ~r> ~r> 

+x 3 y i kxi + x 3 y 2 k x j 
V V 

= xiy 2 k — xiy 3 ) — x 2 yi k 

—> —> —> 
+x 2 y 3 i + x 3 yi j — x 3 y 2 i 

from where the theorem follows. □ 

382 Example Find 


1 


0 

0 

X 

1 

-3 


2 


Solution: ► We have 




Hence 


-4 — > 


i— 3k)x(j + 2k) = ixj + 2i xk - 3kxj — 6kxk 
= 1< — It — 3~f + 6$ 

= —3^ — It + t. 


1 


0 


-3 

0 

X 

1 

= 

-2 

-3 


2 


1 


383 Theorem The cross product vector Si y,fj is simultaneously perpendicular to x’ and yi . 
Proof: We will only check the first assertion, the second verification is analogous. 

3?.(3?xyh = (x,t + x 2 t + x 3 t).((x 2 y 3 -x 3 y 2 )t 

+ (x 3 yi — xiy 3 )Y + (xiy 2 — x 2 y!)lc) 

= xix 2 y 3 — X!X 3 y 2 +x 2 x 3 y! — x 2 xiy 3 + x 3 xiy 2 — x 3 x 2 y! 

= 0, 
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completing the proof. □ 
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384 Theorem l2x(~t>xl?) = 


Proof: 


-4 


ax (b x c ) = (ai i + a2 j + a3 k) x ((b2C3 — b3C2) i 




+ (b 3 ci — bi c 3 ) j + (bic 2 — b 2 c 1 )k; 
a^ (b 3 C! — b! c 3 )t - ai (bi c 2 - b 2 C] f) 

— a 2 (b 2 c 3 — b 3 c 2 )lc + a 2 (bi c 2 - b 2 c 1 fr 


— » 


+a 3 (b 2 c 3 — b 3 c 2 ) j — a 3 (b 3 Ci — bic 3 
[a! ci + a 2 c 2 + a 3 c 3 )(bo + b 2 ^ + b 3 ^) 




completing the proof. □ 

385 Theorem (Jacobi’s Identity) 


+ (— aibi - a 2 b 2 - a3b3)(c! i + c 2 j 




^x(tx^) + l?x("2x~rf) + 1? x (l2 xl> ) = 


— > 


-4 -4 


C 3 l 


Proof: From Theorem 384 we have 

^x(Vx^) = (bl •!?)}} — (l?*!))!?, 

ljX(~C^X~<^) = 

and adding yields the result. □ 

386 Theorem Let (~x 6 [0; 7t[ be the convex angle between two vectors ~x and y\ Then 


Proof: We have 

ll^x^ll 2 = (X2IJ3 — X3IJ2) 2 + (x 3 yi — XTy 3 ) 2 + (xn)2 — x 2 yi ) 2 
= xfy| — 2 x 2 y 3 X 3 y 2 +X 2 y2 + xfy 2 — 2 x 3 y 1 x 1 y 3 + 

+x 2 y| + x 2 y^ — 2 xiy 2 x 2 yi + xjyf 
= (xf +x| +xf)(yf + yj + y §) — (x^i +x 2 y2 +x 3 y 3) 2 

= l^ll 2 llV!l 2 -(^-V ) 2 
= l^ll 2 llV[l 2 - ll^ll 2 liy II 2 cos 2 
= ||'x|| 2 || _ ytl 2 sin 2 (Ic.^y ), 
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whence the theorem follows. The Theorem is illustrated in Figure 8.24. Geometrically it means 
that the area of the parallelogram generated by joining x and y' at their heads is | |~x xy 1 1 . □ 



Figure 8.24: Theorem 386. 

The following corollaries are now obvious. 

387 Corollary Two non-zero vectors 3?, f satisfy ~x? xfj = O if and only if they are parallel. 

388 Corollary (Lagrange’s Identity) 

IlcxVl ! 2 = ||x!! 2 ||y !! 2 - (^.y^ 2 . 

389 Example Let 3? e R 3 , ||x|| = 1. Find 

ll^x^ll 2 + ll^xtll 2 + l^xtll 2 . 


Solution: ► By Lagrange’s Identity, 

||3c x i || 2 = | |3c 1 1 2 
|pxf x^ll 2 = | |T^| | 2 
ll^xYll 2 = ll^ll 2 


— (x ) 2 = 1 — fsU i 


— k 


- (^*?) 2 = i - (^- T ) 2 , 


— = 1 — (~xM<) 2 , 


-4, 


and since ) 2 + ) 2 + = | 3 ?| | 2 - 1 , the desired sum equals 3 - 1=2.-* 


Problem 8.5.1 Consider a tetrahedron ABCS. [A] Find 
All + BC? + CJ>. [B] Find AC^ + C5> + sX + All. 


Problem 8.5.2 Find a vector simultaneously perpendic- 



1 


1 

ular to 

1 

and 

1 


1 


0 


and having norm 3. 


Problem 8.5.3 Find the area of the triangle whose ver- 


tices are at P = 


M (o' 

0 


. Q 


, and R = 


W \°J 




0 

\°J 


Problem 8.5.4 Prove or disprove! The cross product is 
associative. 

Problem 8.5.5 Prove that T* xT* = 0 follows from the 
anti-commutativity of the cross product. 

Problem 8.5.6 Expand the product (l2 — ) X (~c2 +1) ). 


Problem 8.5.7 The vectors^d, b^arc constant vectors. 
Solve the equation Tixfx^xb) = b x (3? X f ) . 


Problem 8.5.8 The vectors a,b, c are constant vec- 
tors. Solve the system of equations 

2x? + t/xli =1?, 37/ -t-~x*xii =~£, 


174 


Chapter 8 


Problem 8.5.9 Prove that there do not exist three unit 
vectors In R 3 such that the angle between any two of 

them be > ^ . 

Problem 8.5.10 Let £ l 3 be a fixed vector. Demon- 
strate that 

X = e R 3 = ~3} 

Is a subspace of R 3 . 

Problem 8.5.11 Let (tJ, b" ) G (R 3 ) 2 and assume that 
a «b =0 and that hi and b are linearly independent. 
Prove that ^,b,^xb are linearly independent. 


Problem 8.5.12 Let (^,b) £ R 3 x R 3 be fixed. Solve 
the equation 

=1j, 

for hi. 

Problem 8.5.13 Let. h , k be fixed vectors in R 3 . Prove 
that 

R 3 x R 3 -> R 3 

L : 

(7?,l/) 7?xl< + tixl? 

is a linear transformation. 


8.6 Planes and Lines in M 3 

390 Definition If bi-point representatives of a family of vectors in R 3 lie on the same plane, we will say 
that the vectors are coplanar or parallel to the plane. 

391 Lemma Let it, in R 3 be non-parallel vectors. Then every vector it of the form 

it = alt + hv^, 

((a, b) 6 R 2 arbitrary) is coplanar with both it and v^. Conversely, any vector t coplanar with both it 
and vl can be uniquely expressed in the form 

= pit + q-vt'. 


Proof: This follows at once from Corollary 367, since the operations occur on a plane, which 

can be identified with R 2 . □ 


A plane is determined by three non-collinear points. Suppose that A, B, and C are non-collinear 

/ \ 

x 


points on the same plane and that R = 


is another arbitrary point on this plane. Since A, B, and C 


w 

are non-collinear, AE> and A(l, which are coplanar, are non-parallel. Since Alt also lies on the plane, we 
have by Lemma 39 1 , that there exist real numbers p , q with 


AR = pAB + qAC. 


By Chasles’ Rule, 


OR = OA + p(OB — OA) + q(OC-OA), 


is the equation of a plane containing the three non-collinear points A, B, and C. By letting T 1 = Olt, 
ct = OA, etc. , we deduce that 

7 - ci = p(b — li ) + q (1? — li ) . 


Thus we have the following definition. 
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392 Definition The parametric equation of a plane containing the point A, and parallel to the vectors it 
and V is given by 

~r* — = pit + qT? 1 . 

Componentwise this takes the form 


x cii = pur + qv! , 

y — a 2 = pu 2 + qv 2 , 
z — a 3 = pu 3 + qv 3 . 


The Cartesian equation of a plane is an equation of the form ax + by + cz = d with (a, b, c, d) gl 4 and 
a 2 + b 2 + c 2 f 0. 

393 Example Find both the parametric equation and the Cartesian equation of the plane parallel to the 







/ \ 


1 


1 


0 

vectors 

1 

and 

1 

and passing through the point 

-1 


1 


0 


l 2 / 


Solution: ► The desired parametric equation is 


X 


1 


1 

y + i 

= s 

1 

+ 1 

1 

z — 2 


1 


0 


This gives s = z — 2 , t = y + 1 — s = y + 1 — z + 2 = y— z + 3 andx = s + t = z — 2 + y— z + 3 = y+1. 
Hence the Cartesian equation is x — y = 1 . ◄ 

394 Theorem Letdi and v be non-parallel vectors and let 7 7? = pit + q v be the equation of the plane 

containing A an parallel to the vectors it and 7. If h' is simultaneously perpendicular to Ti and v’ then 

(7 — t = 0. 


Moreover, the vector 


is normal to the plane with Cartesian equation ax + by + cz = d. 


Proof: The first part is clear, as Tint = 0 = For the second part, recall that at least one 

of a, b, c is non-zero. Let us assume a =f 0. The argument is similar if one of the other letters is 
non-zero and a = 0. In this case we can see that 



c 

— z. 
a 
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Put y = s and z = t. Then 


x_ d 
a 


_b 


c_ 


a 


a 

V 

= s 

1 

+ t 

0 

z 


0 


i 


is a parametric equation for the plane. □ 

395 Example Find once again, by appealing to Theorem 394, the Cartesian equation of the plane parallel 



1 


1 


M 

to the vectors 

1 

and 

1 

and passing through the point 

-i 


1 


0 


l 2 / 



1 


1 


-1 

Solution: ► The vector 

1 

X 

1 

= 

1 


1 


0 


0 


is normal to the plane. The plane has thus equation 


X 


-1 

y + i 

• 

1 

z — 2 


0 


as obtained before. ◄ 


— x + y + 1 =0 


x-y = 1, 


396 Theorem (Distance Between a Point 

point A and perpendicular to vector H. 
plane is 


and a Plane) Let (1* — Tf ) - r£ = 0 be a plane passing through the 
If B is not a point on the plane, then the distance from B to the 

(li — "b ) «h 


— > 


Proof: Let Ro be the point on the plane that is nearest to B. Then BR 0 = ro — b is orthogonal 

to the plane, and the distance we seek is 


P 

[projfT b | 


(i=o 


l(fo 

- b 

Wl 


|rt| 

2 


1^1 

1 


Since Ro is on the plane, roll? = If •if, and so 


Iprojf 1 ^! 


fl?.r^ — bW| |fc£— 


-4 


■pm 


pm 


PIT 


as we wanted to shew. □ 

Given three planes in space, they may (i) be parallel (which allows for some of them to 
coincide), (ii) two may be parallel and the third intersect each of the other two at a line, (iii) 
intersect at a line, (iv) intersect at a point. 


177 


Planes and Lines in R 3 


\ — 7 

397 Definition The equation of a line passing through A G R in the direction of v f 0 is given by 

T> - li = tV, t 6 R. 

398 Theorem Put OA = cf, OT> = "b , and od = ~S. Points (A,B, C) G (R 3 ) 3 are collinear if and only if 

~dxt> +l?x“^ = 0. 


Proof: If the points A, B, C are collinear, then AB is parallel to AC and by Corollary 387, we 

must have 

(~d — it) x (b — it) = 0. 


Rearranging, gives 


\ — 1 \ \ 

c x b — c x a 


ax b = 0 . 


Further rearranging completes the proof. □ 


399 Theorem (Distance Between a Point and a Line) Let L : ~r = it + Alt, V ^O.bea line and let B be a 

point not on L. Then the distance from B to L is given by 

ii(it 

iwn ■ 


Proof: If R 0 — with position vector fo — is the point on L that is at shortest distance from B then 

BR 0 is perpendicular to the line, and so 

IIBRoX^H = ||BRo||| |V| | sin ^ = ||BRo||||Tt||. 


The distance we must compute is 


BR r 


= ||fo — bd, which is then given by 


n -4 ||BR 0 xlt|| ||(fo - b)xlt|| 

" T0 ' b "‘^i pn— 

Now, since R 0 is on the line 3t 0 E R such that fo = it + to it. Hence 


(fo — b ) xlt = (it — b ) xv , 


giving 

i ^ _ ||f^ - b' 

if 

proving the theorem. □ 

Given two lines in space, one of the following three situations might arise: (i) the lines 
intersect at a point, (ii) the lines are parallel, (iii) the lines are skew (one over the other, without 
intersecting). 


II 1 o 



Homework 


Problem 8.6.1 Find the equation of the plane passing 
through the points (a, 0,a), (—a, 1,0), and (0,1,2a) in 

R 3 . 


Problem 8.6.2 Find the equation of plane containing 
the point (1,1,1) and perpendicular to the line x = 
1 + t, y = — 2t, z = 1 — t. 
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Problem 8.6.3 Find the equation of plane containing 
the point (1 , — 1 , — 1 ) and containing the line x = 2y = 3z. 

Problem 8.6.4 Find the equation of the plane perpen- 
dicular to the line ax = by = cz, abc =/ 0 and passing 
through the point (1,1,1) inK 3 . 

Problem 8.6.5 Find the equation of the line perpendic- 
ular to the plane ax + a 2 y + a 3 z = 0, a =/ 0 and passing 
through the point (0, 0, 1 ). 


X 


-1 


1 

y 

= 

-2 

+ t 

0 

z 


0 


1 


Problem 8.6.8 Points a b c in R 3 are collinear and it is 
known that li x~(? = i — 2 j and ^xb = 2 k — 3 i . Find 


Problem 8.6.6 The two planes 

x — y — 2 = 1, x — z = — 1, 

intersect at a line. Write the equation of this line in the 
form 

x 


Problem 8.6.9 Find the equation of the plane which is 


equidistant of the points 


2 

V 1 / 


and 


M 


v 1 / 


= a i tv, te 


Problem 8.6.10 (Putnam Exam, 1980) Let S be the 

solid in three-dimensional space consisting of all points 
(x, y , z) satisfying the following system of six conditions: 


Problem 8.6.7 Find the equation of the plane passing 


through the points 


^ 1 ^ (2) 

0 

v-v 


and parallel to the line 


w 


x > 0, y > 0, z > 0, 

x + y + z < 11, 

2x + 4y + 3z < 36, 

2x + 3z < 24. 

Determine the number of vertices and the number of 
edges of S. 


8.7 M n 

As a generalisation of R 2 and R 3 we define R n as the set of n-tuples 


' 


> 





x 2 

: Xt € R 


Xn 



The dot product of two vectors in R n is defined as 



Xl 


yi 


x 2 

• 

V2 


x n 


Dn 


= xiyi +x 2 y2 H Hx n y n . 



The norm of a vector in R n is given by 
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As in the case of IR 2 and IR 3 we have 

400 Theorem (Cauchy-Bunyakovsky-Schwarz Inequality) Given (Rc , y* ) g (R n ) 2 the following inequality 
holds 


n n n 

Proof: Put a = ^ x£, b = ^ x k y k , and c = ^ y^. Consider 

k=1 k-1 k-1 


f(t) = X(tXk - yk) 2 = t 2 X x k — 2t^x k y k + Y_ yl = at 2 + bt + c. 

k=l k-1 k=1 k-1 

This is a quadratic polynomial which is non-negative for all real t, so it must have complex roots. 
Its discriminant b 2 — 4ac must be non-positive, from where we gather 

4 (£**«*) 2 < 4 (tvl)- 

This gives 

< ||^|| 2 ||^|| 2 

from where we deduce the result. □ 

401 Example Assume that a k , b k , c k) k = 1 , . . . , n, are positive real numbers. Shew that 


Solution: ► Using CBS on £L k =i (a k b k )c k once we obtain 


1/2 


1/2 


L 


Vk=1 


Y_ a k b k c k < Y_ a k b k 

k-1 \k=1 / 

Using CBS again on (]T k=1 a k bj^) 1 2 we obtain 

a.aAc, < (n-,^t’a ,/2 (n.,4)' /2 

< (LR,o ,/ 4 (n.,h) 1 / 4 (n-, 4 ) 1/2 , 

which gives the required inequality. ◄ 

402 Theorem (Triangle Inequality) Given G (R n ) 2 the following inequality holds 

||^ + R/|| < 1 1^|| + ||R/||- 


Proof: We have 

IR? +^|| 2 = (c£ + 

= + 2R^.l> + 1>.T> 

< ir^ii 2 + 2ii^initii + iitii 2 
= (ii^n + iibin 2 , 
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from where the desired result follows. 

□ 

We now consider a generalisation of the Euclidean norm. Given p > 1 and we put 

i/p 

= I V IXvl P ) 


n?n P - iw 


Clearly 


\k=1 


x >0 

i lip — 


I px?| I = 0 <4 ~x? = S 

ii iip 

I ax? 1 1 = I all Pxh I , a£ 

i lip 1 'ii lip’ 


( 8 . 20 ) 

( 8 . 21 ) 

( 8 . 22 ) 

(8.23) 


We now prove analogues of the Cauchy-Bunyakovsky-Schwarz and the Triangle Inequality for ||-|| . For 
this we need the following lemma. 

1 1 

403 Lemma (Young’s Inequality) Let p > 1 and put — | — = 1 . Then for (a, b) 6 ([0; +oo[) 2 we have 

P q 

up b q 

ab < 1 . 

P q 

Proof: Let 0 < k < 1 , and consider the function 


f : 


[0; +oo[ 


(-> x k — k(x — 1 


Then 0 = f'(x) = kx k 1 — k <4 x = 1 . Since f"(x) = k(k — 1 )x k 2 < 0 for 0 < k < 1 ,x > 0, x = 1 

. 1 

is a maximum point. Hence f (x) < f(l)/orx > 0, that is x K < 1 + k(x — 1). Letting k = — and 

P 


a p 

x = — we deduce 
bi 


Rearranging gives 


“ < 1 + i (^_ n . 

bq/p - p Vbi 


ab < b 


1+p/q 


a Pbi+p/q-p t>i+p/q 


from where we obtain the inequality. □ 


The promised generalisation of the Cauchy-Bunyakovsky-Schwarz Inequality is given in the following 
theorem. 

404 Theorem (Holder Inequality) Given (~x?,T?) 6 (R n ) 2 the following inequality holds 

tf-tl < ll^llpll^Hq. 

Proof: If | |~x? 1 1 = 0 or 1 1 1 = 0 there is nothing to prove, so assume otherwise. From the 

Young Inequality we have 


l*kl lUkl . |x k | p 

I 5 I I I I S I I 


TJkI 


X u 
I lip I I a I I q 


II^IL'p lltll/q 


181 


Adding, we deduce 


ms , iph," 

ipii/p irsii/i 

1 1 
- + - 
p q 

i. 

This gives 

i>«*i<ir2 iwitfii,- 

k=l 

The result follows bg observing that 

n 

y x k Tj k 

k=1 

□ 

As a generalisation of the Triangle Inequality we have 
405 Theorem (Minkowski Inequality) Let p g]1;+oo[. Given [ft , vj ) 6 (K n ) 2 the following inequality holds 

IP+^llp < ITlf + ll'SlIp- 

Proof: From the triangle inequality for real numbers 1.6 

|xk + ykl p = |x k + ykllxk + ykl p_1 < (|x k | + lykl) |x k + ykl p_1 • 

Adding 

n n n 

y |x k + ykl p < Y_ |xkl|x k + ykl p_1 + Y_ lykl|x k + ykl p_1 • (8.24) 

k=1 k=l k=1 

By the Holder Inequality 

Lk=i lxkllxk + ykl p_1 < (Lk=i l x kl p ) 1/p (LLi Ixk + ykl (p ^ 1)q ) 1/q 

= (iLi lxkl p ) 1/p (LLi I** + ykl p ) 1/q ( 8 - 25 ) 

- nu^ir 

In the same manner we deduce 

n 

y lykllx k + ykl p ~ T < ||^y || p ||^ + 1/ ||p /q - (8.26) 

k=1 

Hence (8.24) gives 

k=1 

from where we deduce the result. □ 


< L Ixkykl < || x | 


k=1 


n IXkl lykl ^ 


Homework 
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Problem 8.7.1 Prove Lagrange’s identity: 

Problem 8.7.4 Let "ct £ l n be a fixed vector. Demon- 
strate that 

(Xl<»<n a * b *) 2 = (^1<i<n a f) (^1<j<n b ?) 

X = G R n : = 0} 

_ ^l<fc<J<n (akb J _ ajbk)2 

is a subspace of R n . 

and then deduce the CBS Inequality in R n . 

Problem 8.7.5 Let a| G R n , 1 < i < k (k < n) be k 
non-zero vectors such that aNaj = 0 for i =/ j. Prove that 

Problem 8.7.2 Let th G R n for 1 < i < n be unit vec- 
tors with di = 0 . Prove that H 1<t<j < u dt*dj = — — . 

these k vectors are linearly independent. 

Problem 8.7.6 Let a* > 0, 1 < k < n be arbitrary. 

Problem 8.7.3 Let Uk > 0. Use the CBS Inequality to 
shew that 

Prove that 

( S~ „ \ ^ n(n + l)(2n + l) 

2_ ak < 6 2_ k 2- 
\k=1 / k=1 



Answers and Hints 


1 . 1.2 


x € X \ (X \ A) <=4 x G X A x 0 (X \ A) 
<44 x G X Ax G A 
44 x G X n A. 


1.1.3 


X \ (A U B) 44 x G X A (x 0 (AU B)) 

44 xGXA(x^AAx^B) 

44 (xGXAx^A)A(xGXAx^B) 
44 x G (X\A)Ax G (X \ B) 

44 x G (X \ A) n (X \ B). 


1.1.6 One possible solution is 


AuBuC = Au(B\ A) U (C \ (AuB)). 


1.1.8 We have 
giving 
Similarly, 
gives 


|ci| = |a — b + b| < |a — b| + |b|, 
|a|-|b| < |a — b|. 

|b| = |b — a + a| < |b — a| + |a| = |a — b| + |a|, 
|b|-|a| < |a — b|. 


The stated inequality follows from this. 


1.2.1 a ~ a since £ = 1 G Z, and so the relation is reflexive. The relation is not symmetric. For 2 ~ 1 since y G Z 
but 1 9^ 2 since \ g Z. The relation is transitive. For assume a ~ b and b ~ c. Then there exist (m, n) G Z 2 such 
that |y = m, h = n. This gives 

a a b 


c 


b 


• — = mn G 
c 


and so a ' c. 


1.2.2 Here is one possible example: put a ~ b 4 c ‘~ b a G A Then clearly if a G Z \ {0} we have a ~ a since 
aZ a ha = a + 1 G Z. On the other hand, the relation is not symmetric, since 5 ~ 2 as 52 + 5 = 15 G Z but 2 / 5, as 
l2 . 2 = | 0 Z. It is not transitive either, since r,2 3 5 G Z =4 5 ~ 3 and 3 2 2 3 G Z =4 3 ~ 12 but 5 2 2 5 0 Z and so 
5 *<12. 
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1 . 2.4 [B] [x] = x + jZ. [C] No. 

1 . 3.1 Let cu = —j + Then cu 2 + cu + 1 =0 and cu 3 = 1 . Then 

x = a 3 + b 3 + c 3 — 3abc = (a + b + c)(a + tub + tu 2 c)(a + tu 2 b + ctu), 
y = u 3 + v 3 + w 3 — 3uvw = (u + v + w)(u + tuv + cu 2 w)(u + tu 2 v + cuw). 

Then 

(a + b + c) (u + v + w) = au + av + aw + bu + bv + bw + cu + cv + cw, 
(a + cub + cu 2 c)(u + cuv + cu 2 w) = au + bw + cv 

+tu(av + bu + cw) 

+cu 2 ( aw + bv + cu) , 

and 

(a + cu 2 b + cuc)(u + cu 2 v + cuw) = au + bw + cv 

+tu ( aw + bv + cu) 
+tu 2 (av + bu + cw). 


This proves that 


xy = (au + bw + cv) 3 + (aw + bv + cu) 3 + (av + bu + cw) 3 

— 3(au + bw + cv)(aw + bv + cu)(av + bu + cw), 
which proves that S is closed under multiplication. 

1 . 3.2 We have 

xT (yT z) = xT(y ® a® z) = (x) ® (a) ® (y ® a®z) = xg) a® y ® a® z, 
where we may drop the parentheses since <%) is associative. Similarly 

(xTy)T z=(x®a®y)Tz=(x®a®y)® (a 

By virtue of having proved 


(z) = x<gia<g)y®a<g)z. 


xT(yTz) = (xTy)Tz, 


associativity is established. 

1 . 3.3 We proceed in order. 

O Clearly, if a, b are rational numbers, 

|a| < 1, |b| < 1 


labl < 1 


-1 < ab < 1 


1 + ab ' 0, 


whence the denominator never vanishes and since sums, multiplications and divisions of rational numbers 

are rational, ** — — is also rational. We must prove now that —1 < ^ < 1 for (a, b) G] — 1; 1[ 2 . We have 

1 + ab 1 + ab 


—1 < , Q + ^ <1 <4 — 1 — ab < a + b < 1 + ab 


1 + ab 


<4 — 1— ab — a — b<0<l+ab — a — b 

<4 — (a + 1)(b + 1) < 0 < (a— 1)(b — 1). 


Since ( a, b ) g] — 1; 1 [ 2 , (a + 1)(b + l)>0 and so — (a + 1)(b + 1)<0 giving the sinistral inequality. Similarly 
a — 1 < 0 and b — 1 < 0 give (a — 1 )(b — 1 ) >0, the dextral inequality. Since the steps are reversible, we have 

established that indeed — 1 < „ Q ^ < 1 . 

1 + ab 
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© Since a ® b = „ a — — = — 4 Q = b ® a, commutativity follows trivially. Now 
1 + ab 1 + ba 


a ® (b <g) c) = a ( 


One the other hand. 


whence ® is associative. 
© If a ® e = a then a ^ e 


b + c 
1 + be 

b + c 
1 + be 


1 + a 


b + c 


1 + be, 

a(l + be) H-b + c a + b + c + abc 


1+bc + a(b + c) 1+ab+bc+ca' 


(a® b) <g) c = 


a + b 
1 + ab 

a + b 
1 + ab 


1 


a + b 


1 + ae 


1 + ab, 

(a + b) + c(l + ab) 

1 + ab + (a + b)c 

a + b + c + abc 
1 + ab + be + ca’ 

= a, which gives a + e = a+ea 2 or e(a 2 — 1 ) = 0. Since a/±l,we must have e = 0. 


© If a ® b = 0, then -r——; = 0, which means that b = —a. 
1 + ab 


1.3.4 We proceed in order. 

O Since a ® b = a + b - ab = b + a — ba = b ® a, commutativity follows trivially. Now 


a®(b®c) = a®(b + c — be) 

= a + b + c — be — a(b + c — be) 

= a + b + c— ab — be — ca+ abc. 

One the other hand, 

(a®b)®c = (a + b — ab) ® c 

= a + b — ab + c— (a + b — ab)c 
= a + b + c— ab — be — ca+ abc, 

whence ® is associative. 

© If a ® e = a then a + e — ae = a, which gives e(1 — a) =0. Since a =/ 1 , we must have e = 0. 

© If a ® b = 0, then a + b — ab = 0, which means that b (1 — a) = —a. Since a =/ 1 we find b = — y ^ - — 
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Table A. 1: Addition table for Zi i . Table A.2: Multiplication table Zi ^ . 


1 . 3.5 We have 


xoy = (xoy)o(xoy) 
= [yo(xoy)]ox 
= [(xoy)ox]oy 
= [(yox)ox]oy 
= [(x o x) o y] o y 

= (yoy]o(xox) 


= y o x, 

proving commutativity. 

1 . 4.1 The tables appear In tables A. 1 and A.2. 

1 . 4.2 Observe that 

3x 2 — 5x + T = 0 => 4(3x 2 — 5x + T) =40 => x 2 + 2x + T + 3 = 0 =^> (x + T) 2 =8. 

We need to know whether 8 Is a perfect square modulo 1 1 . Observe that (11 — a) 2 = a, so we just need to check half 
the elements and see that 

T 2 = T; 2 2 = 4; 3 2 = 9; ? 2 = 5; 5 2 = 3, 

whence 8 Is not a perfect square modulo 1 1 and so there are no solutions. 

1 . 4.3 From example 50 

x 2 = 5. 

Now, the squares modulo 11 are 0 2 =0, 1" = 1 , 2 2 = 4, 3 2 = 9, 4 2 = 5, 5 2 = 3. Also, (11 — 4) 2 = 7 2 = 5. Hence the 
solutions are x = 4 or x = 7. 

1 . 4.5 Put f (x) = x 4 + x 3 + x 2 + x + T. Then 


f (0) = 1 = 1 mod 1 1 

f (1 ) =5 = 5 mod 11 

f (2) = 31 = 9 mod 11 

f (3) = 121 = 0 mod 11 

f (4) = 341 = 0 mod 11 

f (5) = 781 = 0 mod 11 

f(6) = 1555 = 4 mod 11 

f (7) = 2801 = 7 mod 11 

f (8) =4681 = 6 mod 11 

f (9) = 7381 = 0 mod 11 

f (10) = 11111 = 1 mod 11 
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1 . 5.1 We have 


1 


V2 + 2V3 + 3 vz 


V 2 + 2 V 3 - 3 V 6 
( V 2 + 2 V 3 ) 2 - (3V6) 2 
V2 + 2V3-3V6 
2 + 12 + 4\/6 -54 
V2 + 2V3-3 V6 
-40 + 4\/6 

(y/2 + 2\/3 - 3 y/6) (-40 - 4y/6) 
40 2 - (4V6) 2 

(\/2 + 2V3 - 3\/6) (-40 - 4 a/6) 
1504 

16v / 2 + 22v / 3-30v / 6-18 
~ 376 


1 . 5.2 Since 


(— a)b 1 + ab 1 = (—a + a)b 1 = Opb 1 = Of, 
we obtain by adding — (ab -1 ) to both sides that 

(-a)b- 1 =-(ab- 1 ). 

Similarly, from 

a(— b 1 ) + ab -1 = a(— b~ T + b _1 ) = aOp = Of, 
we obtain by adding — (ab -1 ) to both sides that 

af-b- 1 ) = — (ab- 1 ). 

1 . 6.1 Assume h(b) = h(a). Then 

h(a) = h(b) =+> a 3 = b 3 

=+> a 3 - b 3 = 0 

=+> (a — b)(a 2 + ab + b 2 ) = 0 


Now, 


a\ 2 3a 2 


+ ab + a 2 = (b + + 


This shews that b 2 + ab + a 2 is positive unless both a and b are zero. Hence a — b = 0 in all cases. We have shewn 
that h(b) = h( a) =+> a = b, and the function is thus injective. 

1 . 6.2 We have 

f(a)=f(b) 


6a 


6b 


2a - 3 2b - 3 
6a(2b - 3) = 6b(2a — 3) 

12ab - 18a = 12ab - 18b 

-18a = -18b 

a = b, 


proving that f is injective. Now, if 
then 

that is 6x = y (2x — 3). Solving for x we find 


f(*) = y, y ¥ 3, 

6x 

= y, 


2x — 3 


3y 


2y — 6' 

Since 2y — 6 7 0, x is a real number, and so f is surjective. On combining the results we deduce that f is bijective. 
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2.1.1 A 


2.1.2 A 


1 1 1 

2 4 8 

3 9 27 

1 2 3 

2 4 6 

3 6 9 


2.1.3 M + N = 


a + 1 

0 

2c 


2a 

-4 a 

2c 

a 

b -2a 

0 

, 2M = 

0 

-2a 

2b 

2a 

0 

-2 


2a + 2b 

0 

-2 


2.1.5 A 


and y = 

4. 


13 

-1 


5 

0 

15 

3 

, B = 

6 

1 


2.1.8 The set of border elements Is the union of two rows and two columns. Thus we may choose at most four 
elements from the border, and at least one from the central 3x3 matrix. The largest element of this 3x3 matrix is 
15, so any allowable choice of does not exceed 15. The choice 25, 15, 18,1 23, 20 shews that the largest minimum is 
indeed 15. 


2 . 2.1 


2 . 2.2 


2 2 
0 -2 



a 

b 

c 


a + b + c 

b + c 

c 

AB = 

c + a 

a + b 

b + c 

, BA = 

a + b + c 

a + b 

b 


a + b + c 

a + b + c 

a + b + c 


a + b + c 

c + a 

a 


2.2.3 


1 2 3 


1 1 1 


14 14 14 

2 3 1 


2 2 2 

= 

11 11 11 

3 1 2 


3 3 3 


11 11 11 


whence a + b + c = 36. 


2.2.4 An easy computation leads to N 2 = 


any power — from the fourth on — is the zero matrix. 


0 

0 

4 

12 


0 

0 

0 

-8 


0 

0 

0 

0 

0 

0 

0 

4 

, N 3 = 

0 

0 

0 

0 

and N 4 = 

0 

0 

0 

0 

0 

0 

0 

0 


0 

0 

0 

0 


0 

0 

0 

0 

0 

0 

0 

0 


0 

0 

0 

0 


0 

0 

0 

0 


. ffence 
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2.2.5 AB = O 4 and BA = 


0 2 0 3 

0 2 0 3 

0 2 0 3 

0 2 0 3 

2.2.6 For the first part, observe that 


m(a)m(b) = 


1 0 

-a 1 

0 0 

1 


a 

~1 



1 

0 

b 


-b 

1 

b 2 




2 


0 

0 

1 


a + b 

-.2 1,2 


, , a~ b 

a — b f — — + ab 


f 0 a + b 


-(a + b) f 


(a + b ) 2 


0 0 1 

= m(a + b) 

For the second part, observe that using the preceding part of the problem, 


m(a)m(— a) = m(a — a)=m( 0 ) = 


i 0 0 

0 2 

-° 1 — 
0 0 f 


= 13 , 


giving the result. 
2.2.7 Observe that 

Similarly, 


A 2 = (AB) (AB) = A(BA)B = A(B )B = (AB)B = AB = A. 


B = (BA) (BA) = B(AB)A = B(A)A = (BA)A = BA = B. 
2.2.8 For this problem you need to recall that if |r| < 1 , then 


This gives 


a + ar + ar 2 + ar 3 


1 + i + 4 T + 4 V 


1 , 1,1 

2 + 2? + 2? 


a 


1 -r 
4 


W 3 ’ 

I 2 


3 ’ 


1 + 1 + JT + JT 


1-i 


= 2 . 


and 
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By looking at a few small cases, it is easy to establish by Induction that for n > 1 


0 

i 

0 


l 

0 

0 

22n-1 


2 2ti 

1 

0 

0 

a 2ti = 

0 

i 

0 

22n-A 



2 2n 

0 

0 

i 


0 

0 

1 

2 2 n— 1 J 



2 2n J 


This gives 


i 3 +a + a 2 +a 3 


2.2.9 Observe that 


1 + 

j + j? + 4V + • • 

1 

2 

+ JT + J 5 " + ' ' ' 

0 


4 

3 

2 

3 

0 

1 

2 

+ JT + J5 - + ' ' ' 

1 + 

T + 4 T + 4 V + ' ' 

0 

= 

2 

3 

4 

3 

0 


0 


0 

1 + I + J2+J3+''' 


0 

0 

2 


r 1 

2 





r 1 

-4 x 


-4 X 


-4 x 


16 -x 2 

0 

X 

1 

1 


X 

1 


X 

1 


0 

16 -x 2 


and so we must have 1 6 — x 2 = — 1 or x = ± vT7. 


2.2.10 Disprove! Take A = 


1 0 


0 

1 


and B = 



1 

O 

O 


0 

0 


. Then AB = B. but BA = O 2 . 


2.2. 1 1 Disprove! Take for example A = 


0 0 


and B = 


1 0 


. Then 


1 

1 


1 

0 


-1 0 


-1 

0 

A 2 - B 2 = 

0 1 


-2 

1 


= (A + B)(A — B) 


2.2.14 


= 6. 


32 

-32 

-32 

32 




2.2.15 A 2 


q 2 1001 3 1002 

21002^1001 Q 


2.2.17 The assertion is clearly true for n = 1 . Assume that is it true for n, that is, assume 

cos(n)a — sin(n)a 


sin n a cos n a 


Answers and Hints 


191 


Then 


i n+1 


= A A" 


cos a — sin a 


sin a cos a 






- 


COS 

(n 

a 

— sin(n)a 


sin 

n) 

a 

cos(n)a 


cos a cos(n)a — sin asin(n)a cos asin(n)a — sin a cos(n)a 
sin a cos(n)a + cos asin(n)a — sin asin(n)a + cos a cos(n)a 


cos(n + 1)a — sin(n + 1)a 

sin(n + 1)a cos(n + 1)a 

and the result follows by Induction. 

2.2.18 Let A = [ay], B = [bp] be checkered n x n matrices. Then A + B = ( atj + bp). If j — i is odd, then 
CL; + bp = 0 + 0 = 0, which shows that A + B is checkered. Furthermore, let AB = [cp] with Ctj = JZk i cpkbkj ■ If i 
is even and j odd, then ciik = 0 for odd k and bkj = 0 for even k. Thus Ctj = 0 for i even and j odd. Similarly, Cy = 0 
for odd i and even j. This proves that AB is checkered. 


2.2.19 Put 


0 1 1 
0 0 1 
0 0 0 


We first notice that 




0 0 1 
0 0 0 


0 0 0 

This means that the sum in the binomial expansion 


, J = 0 3 . 


A n = (Is + J) n =Y_ (£)l n ~ k J k 
is a sum of zero matrices for k > 3. We thus have 


A n = I? 


nir’j+^ir'j 2 


1 0 0 
0 1 0 
0 0 1 



0 

n 

n 


0 

0 

1 

E3 

+ 

0 

0 

n 

+ 

0 

0 

0 


0 

0 

0 


0 

0 

0 


1 n 
0 1 
0 0 


n(n+1 ) 


giving the result, since 


/n\ _ n(n-1 ) 


and n 


n| n+1 ) 
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2.2.20 Argue inductively, 


A 2 B = A(AB) =ab = b 

A 3 B = A(A 2 B) = A(AB) = AB = B 


A m B = AB = B. 


Hence B = A m B = 0 n B = 0 n . 


2.2.22 Put A = 


a 


c 


b 

d 


. Using 2.2.21, deduce by iteration that 


A k = (a Ad)*- 1 A. 


2.2.23 


a b 


be = a 2 


2.2.24 ±I 2 , 


a 


c 


b 

—a 


a 2 = 1 — be 


2.2.25 We complete squares by putting Y = 


a 

c 



I. Then 


a 2 + be b( a + d) 

= Y 2 = X 2 - 2X + I = (X - I) 2 = 

-1 

0 

+ I = 

o 

o 

c(a + d) be + d 2 


6 

3 


6 4 


This entails a = 0, b = 0, cd = 6, d 2 =4. Using X = Y + I, we find that there are two solutions, 


1 0 


1 

o 

3 3 

) 

-3 -1 


2.2.26 The matrix 

1 - 11-1 

- 11-11 

A = 

- 11-11 

1 - 11-1 


clearly satisfies the conditions. 
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2.2.27 Put X = 


a b 
c d 


. Then 


X 2 +X 


1 1 
1 1 


a 2 + be + a = 1 
ab + bd + b = 1 
ca + dc + c = 1 
cb + d 2 + d = 1 


The last equation holds 


4a 4 + 8a 3 


a 2 + be + a = 1 
b(a + d + 1 ) =1 
c(a + d + 1) = 1 
(d — a)(a + d + 1 ) = 0 


3a = 0 a e {— ^,—1,0, 1}. 


d = a y ■ 


c = b = 


1 


2a + 1 


1 


(2a + 1 ) 2 + a = 1 


Thus the set of solutions is 


-1 

-1 


0 1 


1/2 

1/2 


-1 

-1 


1 0 


1/2 

1/2 



-3/2 -1/2 
-1 /2 -3/2 

2.2.29 Observe that A = 2I3 — J, where I3 Is the 3x3 identity matrix and 

1 1 1 
J= 1 1 1 
1 1 1 

Observe that J k = 3 k 1 J for integer k > 1 . Using the binomial theorem we have 

A n = (2I 3 - J) n 

= LLo a)(2i 3 )^(-D k j" 

= 2 n l3 + ljL£_i Q)2"- k (-1) k 3 k 

1 

— 2 n ) 

-1 ) n + 2 n+1 (— 1 ) n — 2 n 
— 1 ) n — 2 n (— 1 ) n + 2 n+1 

— 1 ) n — 2 n (— 1 ) n — 2 n (— 1 ) n + 2 71 

2.3.1 There are infinitely many solutions. Here is one: 


= 2"I 3 + -J((-1) T 


1 ) n - 2 n 

1 ) n - 2 n 
n , ->n+1 



1 

2 

3 


-9 

2 

3 


10 

0 

0 

A = 

2 

3 

1 

= 

2 

3 

1 

+ 

0 

0 

0 


3 

1 

2 


3 

1 

2 


0 

0 

0 


2.3.2 There are infinitely many examples. One could take A = 


1 1 


1 -1 


and B = 


1 1 


-1 1 


Another set is 
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2 

0 


0 

0 

A = 

0 

0 

and B = 

0 

2 


2.3.3 If such matrices existed, then by the first equation 

tr (AC) + tr (DB) = n. 

By the second equation and by Theorem 86, 

0 = tr (CA) + tr (BD) = tr (AC) +tr(DB) = n, 


a contradiction, since n > I . 


2.3.4 Disprove! This is not generally true. Take A = 


1 1 


3 0 


and B = 


1 2 


0 1 


. Clearly A T = A and B T = 


AB = 


but 


(AB) T = 


3 1 

3 2 

3 3 

1 2 


2.3.6 We have 


tr (a 2 j = tr 



( 

a b 


a b 



( 

a 2 + be ab + bd 

\ 

tr 






= tr 






c d 


c d 

) 



ca + cd d 2 + cb 

) 


a 2 + d 2 + 2bc 


and 


( 

tr 

f 

a b 

\\ 

V 


c d 

// 


= (a + d) 2 


Thus 


tr (A 2 j = (tr (A)) 2 <==^ a 2 + d 2 +2bc= (a + d) 2 «=*> be = ad, 


is the condition sought. 


2.3.7 


tr ((A — I4) 2 ) = tr (A 2 — 2A + I 4 ) 

= tr (A 2 ) — 2tr (A) + tr (I4) 
= -4 - 2tr (A) + 4 


= — 2tr (A) , 

and tr (3I 4 ) =12. Hence — 2tr (A) = 12 or tr (A) = —6. 

2.3.8 Disprove! Take A = B = I n and n > 1 . Then tr (AB) = n < n 2 = tr (A) tr (B). 


2.3.9 Disprove! Take A 


i 0 

, B = 

0 1 

, c = 

0 0 

. Then tr (ABC) = 1 but tr (BAC) = 0. 

0 0 


0 0 


1 0 









B. We have 
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2 . 3.10 We have 


(AA t ) t = (A t ) t A t = aa t . 


2 . 3.11 We have 


(AB BA) t = (AB) t - (BA) t = B t A t - A t B t = -BA - A(-B) = AB BA. 


2 . 3.13 Let X = [xtj] and put XX T = [ctj]. Then 


0 — Ca — ^ ^ik / Xtk — 0. 
k =1 


2 . 4.1 Here is one possible approach. If we perform Ci <— > C,s on A we obtain 


1 

0 

1 

0 


0 

0 

1 

0 

0 

1 

0 

1 

so take P = 

0 

1 

0 

0 

1 

1 

-1 

1 


1 

0 

0 

0 

1 

-1 

1 

1 


0 

0 

0 

1 


Now perform 2Ri — > Ri on Ai to obtain 



2 

0 

2 

0 


2 

0 

0 

0 

a 2 = 

0 

1 

0 

1 

so take D = 

0 

1 

0 

0 


1 

1 

-1 

1 


0 

0 

1 

0 


1 

-1 

1 

1 


0 

0 

0 

1 


Finally, perform Ri + 2R4 -> Ri on A2 to obtain 


■ 








“ 

4 

-2 

4 

2 


1 

0 

0 

2 

0 

1 

0 

1 

so take T = 

0 

1 

0 

0 

1 

1 

-1 

1 


0 

0 

1 

0 

1 

-1 

1 

1 


0 

0 

0 

1 
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2.4.2 Here is one possible approach. 


a b c 


g h i 


d e f 


P ; P3<-> Pi 


d e f 


g H i 


a b c 


p' 


h g i 
e d f 


b a c 


T^i-Cz-lC, 


h- g g i 

e - d d f 


D:2p 3 


b — a a c 

h — g g i 

e - d d f 


2b — 2a 2a 2c 


Thus we take 


0 

0 

1 


0 

1 

0 

0 

1 

0 

, p' = 

1 

0 

0 

1 

0 

0 


0 

0 

1 


1 

0 

0 


1 

0 

0 

-1 

1 

0 

, D = 

0 

1 

0 

0 

0 

1 


0 

0 

2 


2.4.3 Let Ei j G M nX n(F). Then 


0 0 
0 0 


a 1t ... 0 

a 2 i : 0 


0 0 . a n -i i • 0 

0 0 : a n i : 0 

where the entries appear on the j-column. Then we see that tr ( AEij ) = aji and similarly, by considering BE,j , we 
see that tr ! BE,j ) = bji. Therefore Vi, j, a^ = bjt, which implies that A = B. 
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2.4.4 LetE s t 6 M nX n(R)- Then 


0 0 


0 


E i) A - Cljl Qj2 ... a jn 


0 0 


0 


where the entries appear on the i-th row. Thus 


0 


0 


0 



0 


0 


0 


which means that Vi, j , cp ; cij k = 0. In particular, af t = 0, which means that Vi, j , a, i = 0, i.e., A = O n 


2.5.1 a = 1, b = -2. 

2.5.2 Claim: A -1 = I n — A + A 2 — A 3 . For observe that 

(I n + A) (I n — A + A 2 — A 3 ) = In — A + A 2 — A 3 — A + A 2 — A 3 + A 4 = I n , 

proving the claim. 

2.5.3 Disprove! It is enough to take A = B = 2I n . Then (A + B) -1 = (4I n ) _1 = ^T n but A -1 + B 1 = ll n + jl n = In- 

2.5.8 We argue by contradiction. If exactly one of the matrices is not invertible, the identities 


shew a contradiction depending on which of the matrices are invertible. If all the matrices are invertible then 

On = O n C 1 B 1 A 1 = (ABC)C 1 B 1 A 1 = I n , 

also gives a contradiction. 

2.5.9 Observe that A, B, AB are invertible. Hence 


by cancelling A on the left and B on the right. One can also argue that A = A 1 , B = B 1 , and so AB = (AB) 1 = 
B 1 A 1 = BA. 

2.5.10 Observe that A = (a — b)I n + bll, where U is the n x n matrix with Ip’s everywhere. Prove that 

A 2 = (2(a — b) +nb)A — ((a — b) 2 +nb(a — b))I n . 

2.5. 1 1 Compute ( A — I u ) (B — I n ) ■ 


A = Ain = (ABC)(BC) 1 = On, 

B = I n Bln = (A) 1 (ABC)C 1 = On, 
C = InC = (AB) 1 (ABC) = On, 


A 2 B 2 = In = (AB) 2 


A ABB = ABAB 


AB = BA, 
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2.5.12 By Theorem 86 we have tr (SAS -1 ) =tr(S _1 SA) =tr(A). 

2.7.2 The rank Is 2. 

2.7.3 If B is invertible, then rank ( AB) = rank (A) = rank (BA). Similarly, if A is invertible rank (AB) = rank (B) = 


rank (BA). Now, take A = 

1 0 

and B = 

0 1 

. Then AB = B, and so rank (AB) = 1 . But BA = 

O 

O 


0 0 


0 0 


0 0 


and so rank (BA) = 0. 
2.7.4 Observe that 


1 

1 

0 

0 


1 

1 

0 

0 

0 

0 

1 

1 

R 3 -2(R 1+ R 2 )->R 3 
- — > 

R 4 — 2 R 1 — ) R 4 

0 

0 

1 

1 

2 

2 

2 

2 


0 

0 

0 

0 

2 

0 

0 

2 


0 

-2 

0 

2 


whence the matrix has three pivots and so rank 3. 

2.7.5 The maximum rank of this matrix could be 2. Hence, for the rank to be 1 , the rows must be proportional, 
which entails 

^ = \ =4 x 2 -2x = 0 =4 x G {0,2}. 

2.7.6 Assume first that the non-zero n x n matrix A over a field F has rank 1 . By permuting the rows of the matrix 
we may assume that every other row is a scalar multiple of the first row, which is non-zero since the rank is 1 . 
Hence A must be of the form 



ai 

a 2 

a n 


ai 

A = 

Ai ai 

Ai a 2 

• Ai &n 

= 

a 2 


A n -1 Cl 1 

A n -lQ2 

• An— 1 &n 


a n 


which means that the claimed factorisation indeed exists. 


Ai 


A n -1 


:= XY, 


Conversely, assume that A can be factored as A = XY, where X G M n xi (I 7 ) and Y G Mi X n(F). Since A is 
non-zero, we must have rank (A) > 1 . Similarly, neither X nor Y could be all zeroes, because otherwise A would be 
zero. This means that rank (X) = 1 = rank (Y). Now, since 

rank (A) < mini rank (X) , rank (Y)) = 1 , 
we deduce that rank (A) < 1 , proving that rank (A) = 1 . 

2.7.7 Effecting R3 Ri — > R3; C1R4 — bR2 — > R4 successively, we obtain 










“ 

1 

a 

1 

b 


1 

a 

1 

b 

a 

1 

b 

1 


a 

1 

b 

1 

1 

b 

1 

a 


0 

b - a 

0 

a-b 

b 

1 

a 

1 


0 

a-b 

a 2 -b 2 

1 

1 

d 
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Performing R 2 — aRi — > R 2 ; R 4 + R 3 — > R 4 we have 


1 a 1 b 

0 1 — a 2 b — a 1— ab 

Ob — a 0 a — b 

0 0 a 2 — b 2 2(a — b) 


Performing (1 — a 2 )R 3 — (b — a)R 2 — > R 3 we have 


1 a 

0 1 - a 2 

0 0 

0 0 

Performing R 3 — R 4 — > R 3 we have 

1 a 

0 1 - a 2 

— ■> 

0 0 

0 0 

Performing 2 aR 4 + (a + b)R 3 — ) R 4 we have 


1 b 

b — a 1 — ab 

-a 2 + 2ab — b 2 2a — 2b — a 3 + ab 2 


a 2 — b 2 2(a — b) 


1 

b - a 
-2a(a-b) 


a 2 — b 2 


b 

1 - ab 


-afa 2 — b 2 ) 


2(a — b) 


1 a 

0 1 - a 2 

0 0 

0 0 


Factorising, this is 


1 


a 


1 

b - a 
-2a(a — b) 
0 

1 


b 

1 - ab 


0 1 - a 2 

0 0 

0 0 


— a(a 2 — b 2 ) 

4a 2 — 4ab — a 4 + a 2 b 2 — ba 3 + ab 3 
b 

b — a 1 — ab 

-2a(a — b) -a(a-b)(a + b) 

0 -a(a + 2 + b)(a-b)(a — 2 + b) 


Thus the rank is4if(a + 2 + b)(a — b ) ( a — 2 + b) 0 . The rank is 3 if a + b = 2 and (a,b) y ( 1 , 1 ) orifa + b 
and (a, b) =/ (—1 , —1 ). The rank is 2 if a = b =/ 1 and a/-1. The rank is 1 if a = b = ±1 . 

2.7.8 rank (A) = 4 if m 3 + m 2 +2^0. and rank (A) = 3 otherwise. 


= -2 


2.7.9 The rank is 4 if a =/ ±b. The rank is 1 is a = ±b =/ 0. The rank is 0 if a = b = 0. 


2.7.10 The rank is 4 if ( a — b) (c — d) y 0. The rank is 2 is a = b, c d or if a ^b, c = d. The rank is 1 if a = b and 

c = d. 
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2.7.11 Observe that rank ( AB C ) < rank(B) < 2. Now, 


1 

1 

2 


l 

l 

2 

-2 

x 1 

R 2 + 2 R -| — > R 2 
R3 — Rl — > R 3 

0 

x + 2 

5 

1 

-2 

1 


0 

-3 

-1 


has rank at least 2, since the first and third rows are not proportional. This means that it must have rank exactly 
two, and the last two rows must be proportional. Hence 


x + 2 
-3 



13. 


2.7.13 For the counterexample consider A = 


1 i 
i 1 


2.8. 1 We form the augmented matrix 


p 





-I 

1 

2 

3 

1 

0 

0 

2 

3 

1 

0 

1 

0 

3 

1 

2 

0 

0 

1 


From R2 — 2Ri — > R2 and R3 — 3Ri — > R3 we obtain 


From R 2 <-» R 3 we obtain 


Now, from Ri — R 2 — > Ri and R3 








-i 


1 

2 

3 

1 

0 

0 

' — > 

0 

6 

2 

5 

1 

0 


0 

2 

0 

4 

0 

1 










1 

2 

3 

1 

0 

0 


0 

2 

0 

4 

0 

1 


0 

6 

2 

5 

1 

0 


— 3R2 — > R3 , we obtain 






1 0 3 

4 0 6 


0 2 0 

4 0 1 


0 0 2 

0 1 4 


1 0 3 

4 0 6 

- 

0 1 0 

2 0 4 


0 0 1 

0 4 2 





From 4R2 — > R2 and 4R3 — > R3, we obtain 
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Finally, from Ri — 3R3 — > R3 we obtain 








-i 


1 

0 

0 

4 

2 

0 

- — > 

0 

1 

0 

2 

0 

4 


0 

0 

1 

0 

4 

2 


We deduce that 


r 

-1 

r -| 

1 2 3 


4 2 0 




2 3 T 


2 0 4 




3 T 2 


0 4 2 


2.8.2 To find the inverse of B we consider the augmented matrix 


P 





1 

0 

0 

-1 

1 

0 

0 

0 

-1 

a 

0 

1 

0 


a 

b 

0 

0 

1 


Performing Ri <-» R3, - R>, — > R3, in succession, 


p 





1 


a 

b 

0 

0 

1 

0 

-1 

a 

0 

1 

0 

0 

0 

1 

-1 

0 

0 


Performing Ri + C1R2 — > Ri and R2 — aR3 — > R2 in succession, 


p 





1 


0 

b + a 2 

0 

a 

1 

0 

-1 

0 

a 

1 

0 

0 

0 

1 

-1 

0 

0 


Performing Ri — (b + a 2 )R3 — > R3, — Ri — > Ri and — R2 — > R2 in succession, we find 


whence 


p 

-1 

1 0 0 

— b — a 2 —a —1 

0 

0 

-a -1 0 

0 0 1 

0 0 




B 1 


-b - a 2 -a -1 

-a -1 0 


1 


0 0 
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Now, 


BAB 1 


0 0-1 


a b c 


— b — a 2 —a —1 

0 -1 a 


1 0 0 


-a -1 0 

-lab 


0 1 0 


-1 0 0 


-1 a 0 


0 


0 0 
a 1 0 

b 0 1 


0 0 


c 0 0 

= A T , 

which is what we wanted to prove. 

2 . 8.3 First, form the augmented matrix: 


P 





1 

1 

0 

0 

1 

0 

0 

1 

1 

0 

0 

1 

0 

1 

1 

X 

0 

0 

1 


Perform R 2 — Ri — > R 2 and R 3 — Ri — > R 3 : 


p 




1 

1 

0 

0 

1 0 

0 

0 

1 

0 

-1 1 

0 

0 

1 

X 

-1 0 

1 


Performing R 3 — R 2 — > R 3 : 


1 

0 

0 

1 

0 

0 

0 

1 

0 

-1 

1 

0 

0 

0 

X 

0 

-1 

1 


Finally, performing — R 3 — > R 3 : 


p 





1 

1 

0 

0 

1 

0 

0 

0 

1 

0 

-1 

1 

0 

0 

0 

1 

0 

1 

X 

1 

X- 


2 . 8.4 Since MM 1 = I 3 , multiplying the first row of M times the third column of M 1 , and again, the third row of 
M times the third column of M 1 , we gather that 


1 • 0 + 0 • a + 1 • b = 0, 


0-0 + 1 • a + 1 • b = 1 


b = 0, a = 1. 
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1 

0 

0 


1 

2.8.5 It is easy to prove by induction that A n = 

n 

i 

0 

. Row- reducing, (A n ) 1 = 

— n 


n(n + 1 ) 

n 

i 


(n - 1 )n 


L 2 


L 2 


2.8.6 Take, for example, A 


0 

1 


-1 


= —A- 1 . 


0 


2.8.7 Operating formally, and using elementary row operations, we find 


B 1 = 


Thus B is invertible whenever a — 1 ± 
2.8.8 Form the augmented matrix 


p 





1 

a 

2a 

3a 

i 

0 

0 

0 

b 

2b 

0 

i 

0 

0 

0 

c 

0 

0 

i 


Perform — Ri — > Ri , — R2 — > R2, — R3 — > R3, in succession, obtaining 
aba 


i 

2 

3 

1/a 

0 

0 

0 

i 

2 

0 

1/b 

0 

0 

0 

i 

0 

0 

l/c 


Now perform Ri — 2R2 — > Ri and R2 — 2R3 — > R2 in succession, to obtain 


1 

0 

-1 

1/a 

-2/a 

0 

0 

1 

0 

0 

1/b 

—2/ c 

0 

0 

1 

0 

0 

l/c _ 


Finally, perform Ri + R3 — > Ri to obtain 


1 

0 

0 

1/a 

-2/b 

l/c 

0 

1 

0 

0 

1/b 

-2/c 

0 

0 

1 

0 

0 

l/c_ 


a 

2a 

3a 

-1 

1/a 

-2/b 

1/c 

0 

b 

2b 

= 

0 

1/b 

-2/c 

0 

0 

c 


0 

0 

l/c _ 


a z - 1 


a z + 2a-2 


a-2 


a 2 — 5+2a a 2 — 5+2a a 2 — 5+2a 

1 


2 a + 4 

a 2 — 5+2a a 2 — 5+2a 


1 2 — 5 + 2a 


2 a 

a 2 — 5 + 2a 


2a+5 a 

a 2 — 5 +2 a a 2 — 5+2a 


y/l. 


0 0 
1 0 • 
-n 1 


Whence 
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2.8.9 To compute the inverse matrix we proceed formally as follows. The augmented matrix is 


p 





1 

b 

a 

0 

1 

0 

0 

c 

0 

a 

0 

1 

0 

0 

c 

b 

0 

0 

1 


Performing bR 2 - cRi — > R 2 we find 


p 





1 

b 

a 

0 

1 

0 

0 

0 

— ca 

ab 

— c 

b 

0 

0 

c 

b 

0 

0 

1 


Performing C1R3 + R2 — > R3 we obtain 


p 




-1 

b 

a 

0 

1 0 

0 

0 

— ca 

ab 

— c b 

0 

0 

0 

2ab 

— c b 

a 


Performing 2R2 — R3 — > R2 we obtain 


p 




-1 

b 

a 

0 

1 0 

0 

0 

— 2ca 

0 

— c b 

—a 

0 

0 

2ab 

— c b 

a 


Performing 2cRi + R2 — > Ri we obtain 


p 



1 

2b c 

0 

0 

c b —a 

0 

— 2ca 

0 

— c b —a 

0 

0 

2ab 

— c b a 


From here we easily conclude that 


r 



-1 



- 

b 

a 

0 


1 1 

2b 

1 

2c 

a 

2b c 

c 

0 

a 

= 

1 

2a 

b 

2ac 

1 

2c 

0 

c 

b 


__ c 
2b a 

1 

2a 

1 

2b 


as long as abc =/ 0. 

2.8.10 Since AB is invertible, rank (AB) = n. Thus 

n = rank (AB) < rank (A) < n =4 rank (A) = n, 
n = rank (AB) < rank (B) < n rank (B) = n, 


whence A and B are invertible. 
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2 . 8.11 Form the expanded matrix 


1 + a 

1 

1 

1 

0 

0 

1 

1 +b 

1 

0 

1 

0 

1 

1 

1 + c 

0 

0 

1 


Perform bcRi — > Ri , abR3 — > R3, caR2 — > R2- The matrix turns into 


be + abc 

be 

be 

be 

0 

0 

ca 

ca + abc 

ca 

0 

ca 

0 

ab 

ab 

ab + abc 

0 

0 

ab 


Perform Ri + R2 + R3 — > Ri the matrix turns into 


ab + be + ca + abc 

ab + bc + ca+abc 

ab + be + ca + abc 

be 

ca 

ab 

ca 

ca + abc 

ca 

0 

ca 

0 

ab 

ab 

ab + abc 

0 

0 

ab 


Perform , , . — : — j— Ri -> R 1 . The matrix turns into 

ab+bc+ca+abc 1 1 


1 

1 

1 

b c 

ca 

ab 

ab+bc+ca+abc 

ab+bc+ca+abc 

ab+bc+ca+abc 

ca 

ca + abc 

ca 

0 

ca 

0 

ab 

ab 

ab + abc 

0 

0 

ab 


Perform R2 — caRi — > R2 and R3 — abR3 — > R3. We get 


1 1 1 

0 abc 0 
0 0 abc 


be 

ab+bc+ca+abc 

abc 2 

ab+bc+ca+abc 

ab 2 c 

ab+bc+ca+abc 


ca 


ca 

ab+bc+ca+abc 

c 2 a 2 

ab+bc+ca+abc 

a 2 b c 

ab+bc+ca+abc 


Perform A f iC R2 —1 R2 and A ^ c R j — > R3. We obtain 


ab 


ab 

ab+bc+ca+abc 

a 2 b c 

ab+bc+ca+abc 

2 U 2 


a ~ b 

ab+bc+ca+abc 


1 

1 


b c 


ca 


ab 

ab + b c + ca+ abc 


ab + b c + ca+ abc 


ab+bc+ca+abc 

0 

1 

0 

c 

1 

ca 


a 

ab+bc+ca+abc 

b 

b(ab + bc+ca+abc) 


ab+bc+ca+abc 

0 

n 


b 


a 

1 

ab 


ab+bc+ca+abc 


ab+bc+ca+abc 

c 

c( ab+bc+ca+abc) _ 

Ri 

- 

r 2 

- R 3 -» Ri , getting 





1 

0 

0 

a+b + b c 


c 


b 


ab + b c + ca+ abc 


ab+bc+ca+abc 


ab+bc+ca+abc 

0 


0 

c 

1 

ca 


a 


ab+bc+ca+abc 

b 

b( ab+bc+ca+abc) 


ab+bc+ca+abc 

0 

0 


b 


a 

1 

ab 



ab+bc+ca+abc 


ab+bc+ca+abc 

c 

c( ab+bc+ca+abc) _ 
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We conclude that the inverse Is 


b + c + b c 

ab+bc+ca+abc 


c 

ab+bc+ca+abc 


b 

ab+bc+ca+abc 


c 

ab+bc+ca+abc 

b 

ab+bc+ca+abc 


c + a+ ca 

ab + bc+ca+ ab c 

a 

ab + b c + ca + ab c 


a 

ab+bc+ca+abc 

a + b + ab 

ab+bc+ca+abc 


2.8.16 Since rank (A 2 ) < 5, A 2 is not invertible. But then A is not invertible and hence rank (A) < 5. 

2.8.17 Each entry can be chosen in p ways, which means that there are p 2 ways of choosing the two entries of an 
arbitrary row. The first row cannot be the zero row. hence there are p 2 — 1 ways of choosing it. The second row 
cannot be one of the p multiples of the first row, hence there are p 2 — p ways of choosing it. In total, this gives 
(p 2 — 1 )(p 2 — p) invertible matrices in Z p . 

2.8.18 Assume that both A and B are m x n matrices. Let C = [A B] be the m x (2n) obtained by juxtaposing 
A to B. rank (C) is the number of linearly independent columns of C, which is composed of the columns of A and 
B. By column-reducing the first n columns, we find rank (A) linearly independent columns. By column-reducing 
columns n + 1 to 2n, we find rank (B) linearly independent columns. These rank (A) + rank (B) columns are 
distinct, and are a subset of the columns of C. Since C has at most rank (C) linearly independent columns, it 
follows that rank (C) < rank (A) + rank (B). Furthermore, by adding the n + k-column (1 < k < n) of C to the 
k-th column, we see that C is column-equivalent to [A + B B], But clearly 

rank (A + B) < rank ([A + B B]) = rank (C) , 

since [A + B B] is obtained by adding columns to A + B. We deduce 

rank (A + B) < rank ([A + B B]) = rank (C) < rank (A) + rank (B) , 

as was to be shewn. 


2.8.19 Since the first two columns of AB are not proportional, and since the last column is the sum of the first two, 
rank (AB) = 2. Now, 




2 

. -. 


0 -1 -1 


0 -1 -1 

(AB) 2 = 

-1 0 -1 

= 

0 -1 


1 1 2 


1 1 2 


Since BA is a 2 x 2 matrix, rank (BA) < 2. Also, 

2 = rank (AB) = rank ^(AB) 2 ^ = rank (A(BA)B) < rank (BA) , 
whence rank (BA) = 2, which means BA is invertible. Finally, 

(AB) 2 -AB = 0 3 =+> A(BA-I 2 )B=0 3 =+> BA(BA-I 2 )BA = bo 3 a =+> ba-i 2 =o 2 , 


since BA is invertible and we may cancel it. 

3.1.1 The free variables are z and w. We have 

2y + w = 2 =+> 2y = 2 — w =+> y = 1 + w, 


and 

Hence 


x+y+z+w=0 =+> x = — y — z — w = 2y+2z + 2w. 









X 


0 


0 


0 

y 


T 


0 


0 


= 


+ z 


+ w 


Z 


0 


T 


0 

W 


0 


0 


T 


This gives the 9 solutions. 
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3 . 1.2 We have 


T 2 3 


X 


5 

2 3 T 


v 

= 

6 

3 T 2 


z 




Hence 


-i 


X 


T 2 3 


5 


4 2 0 


5 


4 

V 

= 

2 3 T 


6 

= 

2 0 4 


6 

= 

3 

z 


3 T 2 


0 


0 4 2 


0 


3 


3 . 1.3 The augmented matrix of the system Is 










1 

T 

-2 

T 

5 


T 

-2 

T 

5 

2 

2 

0 

7 

R3— 5 Ri — > R 3 

• — > 

R 2 — 2 R 1 — ) R 2 

0 

6 

-2 

-3 

5 

-3 

4 

T 


0 

7 

-T 

2 






T 

-2 

T 

5 





6R3 — 7R2— > R3 

0 

6 

-2 

-3 






0 

0 

8 

7 


Backward substitution yields 


6y = 2z 


8z = 7 =+> 5 • 8z = 5 • 7 =+> z = 35 = 9, 

3 = 2- 9- 3 = 15 = 2 =4 TT - 6y = TT - 2 =4 y = 22 = 9, 
x = 2y-Tz + 5 = 2- 9-1-9 + 5 = 14=1. 


Conclusion : 

x = T, y = 9, z = 9. 

Check: 

T — 2 • 9 + 9 = —8 = 5, 
2-1 + 2- 9 = 20 = 7, 
5-1-3-9+4-9 = 14=1. 


3 . 1.4 We need to solve the system 

a — b + c — d = p(— 1) = —10, 
a = p(0) = —1 , 
a + b + c + d = p(l) =2, 
a + 2b + 4c + 8d = p(2) = 23. 

Using row reduction or otherwise, we find a = — 1 , b = 2, c = —3, d = 4, and so the polynomial is 

p(x) =4x 3 — 3x 2 + 2x — 1 . 


3 . 1.5 Using the encoding chart 


0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

A 

B 

C 

D 

E 

E 

G 

H 

I 

J 

K 

E 

M 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

N 

O 

P 

Q 

R 

S 

T 

U 

V 

W 

X 

Y 

Z 
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we find 


M 


12 


I 


8 


S 


18 


T 


19 


F 


5 

U 

= 

20 

, P3 = 

S 

= 

18 

, P4 = 

E 

= 

4 

, Ps = 

O 

= 

14 

, P 6 = 

A 

= 

0 

N 


13 


T 


19 


A 


0 


F 


5 


L 


11 


Thus 



20 


u 


18 


S 


4 


E 


14 


O 


0 


A 

ap 2 = 

10 

= 

K 

, APs = 

24 

= 

Y 

, ap 4 = 

2 

= 

C 

, APs = 

5 

= 

F 

, AP 6 = 

15 

= 

P 


0 


A 


12 


M 


0 


A 


10 


K 


22 


W 


Finally, the message is encoded into 

OGY UKA SYM ECA OFK APW. 

3 . 1.6 Observe that since 103 is prime, Zi 03 is a field. Adding the first hundred equations, 

1 OOxo + xi + X 2 + • • • + xi 00 = 4950 =4 99xo = 4950 — 4949 = 1 xo = 77 mod 1 03. 

Now, for 1 < k < 100, 

xi< = k — 1 — xo = k — 78 = k + 25. 

This gives 

xi = 26, X 2 = 27, . . . , X 77 = 102, X 78 = 0 ,X 79 = 1 ,X 8 o = 2 X 100 = 22. 

3 . 3.1 Observe that the third row is the sum of the first two rows and the fourth row is twice the third. So we have 


1 

1 

1 

1 

1 

1 


i 

i 

i 

i 

i 

i 

1 

0 

1 

0 

1 

-1 


i 

0 

i 

0 

i 

-i 

2 

1 

2 

1 

2 

0 

R3 — R 1 R 2~ > R 3 
- — > 

R4 — 2 R ] — 2 R 2 — ^ R 4 

0 

0 

0 

0 

0 

0 

4 

2 

4 

2 

4 

0 


0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 


i 

0 

0 

0 

i 

0 







0 

i 

i 

i 

0 

i 







0 

0 

i 

0 

0 

-i 







r 2 -r 5 ->r 2 
R t — R 5 — > R i 

0 

0 

0 

0 

0 

0 







0 

0 

0 

0 

0 

0 







i 

0 

0 

0 

i 

0 


i 

0 

0 

0 

i 

0 

0 

i 

i 

i 

0 

i 

0 

0 

i 

0 

0 

-1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


Rearranging the rows we obtain 
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Hence d and f are free variables. We obtain 

c = -1, 

b = 1 — c — d = 2 — d, 
a = -f. 

The solution is 


a 


0 


0 


-1 

b 


2 


-1 


0 

c 

= 

-1 

+ d 

0 

+ f 

0 

d 


0 


1 


0 

f 


0 


0 


1 


3.3.2 The unique solution is 


1 

1 

1 

1 


3.3.3 The augmented matrix of the system is 


p 



1 

2m 

1 

1 

2 

1 

2m 

1 

4m 

1 

1 

2m 

2m 2 


Performing Ri h R 2 . 


P 



1 

1 

2m 

1 

4m 

2m 

1 

1 

2 

1 

1 

2m 

2m 2 


Performing R2 h R3. 


P 



-1 

1 

2m 

1 

4m 

1 

1 

2m 

2m 2 

2m 

1 

1 

2 


Performing R2 — Ri — > Ri and R3 — 2mRi — > R3 we obtain 


P 



1 

1 

2m 

1 

4m 

0 

1 -2m 

2m- 1 

2m 2 — 4m 

0 

1 -4m 2 

1 -2m 

2 -8m 2 
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If m = j the matrix becomes 


p 



-1 

1 

1 

1 

2 

0 

0 

0 

3 

2 

0 

0 

0 

0 


and hence it does not have a solution. If m =/ j, by performing 1 1 2m R2 — > R2 and 1 _ 1 2m R3 — > R3, the matrix 
becomes 





-1 

1 

2m 

1 

4 m 

0 

1 

-1 

2m(m — 2 ) 
1 —2m 

0 

1 + 2m 

1 

2(1 + 2m) 


Performing R3 — (1 + 2 m) R2 — > R3 we obtain 


p 



-1 

1 

2m 

1 

4 m 

0 

1 

-1 

2m(m— 2) 

1 -2m 

0 

0 

2 + 2m 

2(1+2m)(1-m 2 ) 

1 -2m J 


If m = — 1 then the matrix reduces to 


The solution In this case Is 


If m y — 1 , m y — j we have the solutions 


p 



1 

l 

-2 

l 

-4 

0 

l 

-1 

2 

0 

0 

0 

0 


X 


z 

y 

= 

2 + z 

z 


z 


p -i 


p -1 




X 


1 -2m 

y 

= 

1 —3m 
1 —2m 



( 1 +2m) ( 1 — m) 



1 —2m 


3 . 3.4 By performing the elementary row operations, we obtain the following triangular form: 

ax + y — 2z = 1 , 

(a - 1) 2 y + (1 - a)(a — 2)z = 1 - a, 

(a — 2)z = 0. 

If a = 2 , there is an infinity of solutions: 


X 


1 + 1 

y 

= 

-1 

z 


t 


ten. 
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Assume a =/ 2. Then z = 0 and the system becomes 

ax + y = 1 , 

(a- l) 2 y = 1 - a, 

2x + (3 — a)y = 1 . 

We see that if a = 1 , the system becomes 

x + y = 1, 

2x + 2y = 1 , 

and so there is no solution. If ( a — 1 ) ( a — 2 ) =/ 0 , we obtain the unique solution 


X 


1 

a — 1 

V 

= 

1 


a— 1 

z 


0 



X 


1 

m — 2 

3.3.5 The system is solvable ifm/0,my ±2. II rn =/ 2 there is the solution 

y 

= 

m + 3 
m — 2 


Z 


m + 2 
m — 2 



- 




- “ 

b a 0 


X 


c 

c 0 a 


y 

= 

b 

0 c b 


z 


a 


The system will have the unique solution 


■ ■ 




' “ 

X 


b a 0 


c 

y 

= 

c 0 a 


b 

z 


0 c b 


a 


i 

1 

a 



2b 

2c 

2b c 


L 

1 

b 

1 


b 

2a 

Zac 

2c 



c 

1 

1 


a 

2b a 

2a 

2b 



b 2 + c 2 — a 2 
2b c 

a 2 + c 2 — b 2 
2ac 

a 2 + b 2 — c 2 

- 2ab - 
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as long as the inverse matrix exists, which is as long as abc y 0 
3 . 3.8 We first form the augmented matrix, 


1 - a 

2a + 1 

2a + 2 

a 


1 

3a + 1 

2a + 2 

3a + 2 

a 

a 

0 

2a + 2 

Rl +R 2 ^Rt 

a 

a 

0 

2a + 2 

2 

a + 1 

a- 1 

a 2 — 2a + 9 


2 

a + 1 

a- 1 

a 2 — 2a + 9 


R 2 — oi R i — > R 2 
R 3 — 2 R i — > R ? 


1 3a +1 2a + 2 

0 -3a 2 -2a 2 - 2a 

0 —5a — 1 —3a — 5 


3a + 2 


3 a 2 


a — 8a + 5 


After (—5a — 1 )R 2 + 3a 2 R3 — > R 2 , this las matrix becomes 


1 

3a + 1 

2a + 2 

3a + 2 

0 

0 

a 3 — 3a 2 + 2a 

3a 4 -9a 3 + 18a 2 - 10a -2 

0 

-5a- 1 

-3a -5 

a 2 — 8a + 5 


Exchanging the last two rows and factoring. 


1 

3a + 1 

2a + 2 

3a + 2 

0 

-5a- 1 

-3a -5 

a 2 — 8a + 5 

0 

0 

a(a — 1 )(a — 2) 

(a — 1 )(3a 3 -6a 2 + 12a + 2) 


Thus we must examine a £ {1,2,3} and a 0 (0, 1 , 2}. 

Clearly, ifa(a — 1 ) (a — 2) =/ 0, then there is the unique solution 

y = 


2 + 12a -6a 2 + 3a 3 


2a 3 -3 a 2 + 6a + 10 


a (a — 2) 

If a = 0, the system becomes 

which is inconsistent (no solutions). 
If a = 1 , the system becomes 

which has infinitely many solutions, 


a (a -2) 

x + y + 2z = 0, 0 = 2, 2x + y — z = 0, 

3y + 4z = 1 , x + y = 1 , 2x + 2y = 8, 


2 a 3 — a 2 + 4a + 6 
a (a — 2) 


y = 3 - 3 Z ’ 


2 4 

x = — + — z, 

3 3’ 


If a = 2, the system becomes 

—x + 5y + 6z = 2, 2x + 2y = 6, 2x + 3y + z = 9, 
which is also inconsistent, as can be seen by observing that 

(— x + 5y + 6z) — 6(2x + 3y + z) = 2 — 18 =£> — 13x — 13y = — 18, 
which contradicts the equation 2x + 2y =6. 
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x = 2~ 2 3 6 
V = 2- 3 3 12 
z = 2 2 3- 7 . 


3.3. 10 Denote the addition operations applied to the rows by ai , u 2 , (13 , cu and the subtraction operations to the 
columns by bi , t>2, b3, b4. Comparing A and A T we obtain 7 equations in 8 unknowns. By inspecting the diagonal 
entries, and the entries of the first row of A and A T , we deduce the following equations 

ai = bi , 


a 2 = b 2 , 
a 3 = b 3 , 
a4 = b 4 , 
ai — b 2 = 3, 
ai — b3 = 6, 
ai — b4 = 9. 

This is a system of 7 equations in 8 unknowns. We may let a4 = 0 and thus obtain ai = bi = 9, a 2 = b 2 = 6, 
a3 = b3 = 3, a4 = b4 = 0. 


3.3. 1 1 The augmented matrix of this system is 


Permute the rows to obtain 


-y 

i 

0 

0 

1 

0 

i 

-y 

l 

0 

0 

0 

0 

l 

-y 

l 

0 

0 

0 

0 

l 

-y 

i 

0 

i 

0 

0 

l 

-y 

0 

i 

0 

0 

l 

-y 

0 

0 

l 

-y 

l 

0 

0 

0 

0 

l 

-v 

i 

0 

i 

-y 

i 

0 

0 

0 

-v 

l 

0 

0 

i 

0 


R4 we get 


i 

0 

0 

i 

-y 

0 

0 

i 

-y 

i 

0 

0 

0 

0 

i 

-y 

l 

0 

0 

-y 

i 

-1 

y 

0 

0 

i 

0 

y 

i -y 2 

0 


Performing R5 + y R 1 —1 R5 and R4 — Ri 
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Performing R5 — R2 — > R5 and R4 + yR2 — > R4 we get 


1 

0 

0 

i 


0 

0 

1 

-v 

i 

0 

0 

0 

0 

1 

— y 

i 

0 

0 

0 

1 -y 2 

y 1 

y 

0 

0 

0 

y 

y 1 

1 -y 2 

0 


Performing R5 — yR3 — > R5 and R4 + (y 2 — 1 )R3 — > R4 we get 


l 

0 

0 

l 

-y 

0 

0 

l 

-y 

l 

0 

0 

0 

0 

l 

-y 

l 

0 

0 

0 

0 

-y 3 + 2y - 1 

y 2 + y - 1 

0 

0 

0 

0 

y 2 + y 

1 -y-y 2 

0 


Performing R5 + R4 — > R5 we get 


1 0 0 


-y 


0 


0 i -y 


0 


0 


0 0 


-y 


0 • 


000 


— y 3 + 2y — l y 2 +y 


0 


0 0 0 — y 3 + y 2 + 3y — 2 


0 


0 


Upon factoring, the matrix is equivalent to 


l 

0 

0 


l 

-y 

0 

0 

l 

-y 


l 

0 

0 

0 

0 

i 


-y 

i 

0 

0 

0 

0 

-(y 

-i)(y 2 +y-i) 

y 2 + y - 1 

0 

0 

0 

0 

-(y 

-2)(y 2 +y-1) 

0 

0 


Thus (y — 2)(y 2 + y — 1 )%4 = 0. If y = 2 then the system reduces to 


- 





- 

i 

0 

0 

l 

-2 

0 

0 

l 

-2 

l 

0 

0 

0 

0 

l 

-2 

l 

0 

0 

0 

0 

-5 

5 

0 

0 

0 

0 

0 

0 

0 
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In this case X 5 is free and by backwards substitution we obtain 


■ 


' " 

Xi 


t 

X2 


t 

X3 

= 

t 

X 4 


t 

X5 


t 


t G 


If y 2 + y — 1 = 0 then the system reduces to 


1 

0 

0 

1 

~-y 

0 

0 

1 

-y 

1 

0 

0 

0 

0 

1 

-y 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


In this case X 4 , X 5 are free, and 


Xi 


yt — s 

X2 


y 2 s — yt — s 

X3 

= 

ys — t 

X 4 


s 

X5 


t 


(s,t) G 


Since y 2 s — s = (y 2 + y — 1 )s — ys, this last solution can be also written as 




' 

Xl 


yt - s 

X2 


-ys - yt 

X3 

= 

ys — t 

X 4 


s 

X5 


t 


(s,t) G 


Finally, if (y — 2)(y 2 + y — 1 ) =/ 0, then X 4 = 0, and we obtain 


- 


‘ “ 

Xl 


0 

X2 


0 

X3 

= 

0 

X 4 


0 

X5 


0 
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4 . 1.1 No, since 1 = V is not fulfilled. For example 


1 


1 • 1 


i 


= 


y 


1 


0 


i 


4.1.2 We expand (If + If) fit + b ) in two ways, first using 4.7 first and then 4.8, obtaining 

(If + If) Ct£ + b) = (1 f + 1f )~<i + (lF + lF)b = iJ+ii + b + b, 
and then using 4.8 first and then 4.7, obtaining 

(If + 1f)Cc£ + 1>) =1f(^+^) + 1f(^+^) = +1> + li +1> . 

We thus have the equality 

lf+lS+b + b=li + b+li + b. 

Cancelling ti from the left and b from the right, we obtain 

\ — ) — ) \ 

a + b = b + a, 


which is what we wanted to shew. 

4 . 1.3 We must prove that each of the axioms of a vector space are satisfied. Clearly if (x, y , a) g R + x R + X R then 
x © y = xy > 0 and a ® x = x“ > 0, so V is closed under vector addition and scalar multiplication. Commutativity 
and associativity of vector addition are obvious. 


Let A be additive identity. Then we need 

x © A = x 


xA = x 


A = 1. 


Thus the additive identity is 1 . Suppose I is the additive inverse of x. Then 


xffi I = 1 


xl = 1 


1 = 1 . 

X 


Hence the additive inverse of x is 


Now 


a © (x © y) = (xy )“ = x“y“ = x 


cx cx a ot ^ „ . cx r ^ ,, \ 

y = (a © x) 


(a© V), 


and 


(a+ |3) ® x =x“ +p =x“x p = (x“) © (x p ) = (a ® x) © (|3 ® x), 
whence the distributive laws hold. 


Finally, 


and 


a © ( |3 © x) 


and the last two axioms also hold. 


1 ©) X = X 1 = X, 


cp®x)“ = (x p r = x ap 


(<x|3) © x, 


4 . 1.4 C is a vector space over R, the proof is trivial. But R is not a vector space over C, since, for example taking 
i as a scalar (from C) and 1 as a vector (from R) the scalar multiple i • 1 = i 0 R and so there is no closure under 
scalar multiplication. 


4 . 1.5 One example is 


(z 2 r = 


Addition is the natural element-wise addition and scalar multiplication is ordinary element-wise scalar multiplica- 
tion. 
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4 . 1.6 One example is 



Addition is the natural element-wise addition and scalar multiplication is ordinary element-wise scalar multiplica- 
tion. 

4 . 2.1 Take a £ R and 


7 ? = 


Then 


a 



a' 


b 



b' 



G X, a — b — 3d = 

o, y = 


G X, a' 

c 



c' 


d 



d' 



a 


a' 


a + aa' 


b 


b' 


b + ab' 


x + ay = 


+ a 


= 



c 


c' 


c + ac' 


d 


d' 


d+ ad' 


Observe that 

(a + aa') — (b + ab') — 3(d + ad') = (a — b — 3d) + a(a' — b' — 3d') = 0 + a0 = 0, 
meaning that it + aij G X, and so X is a vector subspace of M 4 . 

4 . 2.2 Take 


it = 




' 

ai 


0.2 

2cn — 3bi 


2az — 3b2 

5bi 

= 

5b 2 

ai +2bi 


a 2 + 2b 2 

ai 


a 2 


a G R. 


Put s = ai + ac 12 , t = bi + ab 2 . Then 


it + aV 


ai + aa 2 


s 

2(ai + aa 2 ) — 3(bi + ab 2 ) 


2s -3t 

5 (b i + ab 2 ) 

= 

5t 

(ai + aa 2 ) + 2(bi + ab 2 ) 


s + 2t 

ai + aa 2 


s 


since this last matrix has the basic shape of matrices in X. This shews that X is a vector subspace of R 5 . 
4 . 2.7 We shew that some of the properties in the definition of vector subspace fail to hold in these sets. 
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0 


0 

O Take it = 

1 

, a = 2. Then it £ V but 2x? = 

2 


0 


0 


0 V as O 2 + 2 2 = 4 y 1 . So V is not closed under scalar 


multiplication. 



0 


1 


1 

© Take it = 

1 

= 

0 

. Then it £ W, it £ W but it + it = 

1 


0 


0 


0 


^ Was 1 • 1 = 1 yo. Hence W is not closed 


under vector addition. 

1 1 


© Take T? = 


0 0 

closed under scalar multiplication. 


. Then ~x? £ Z but — x? = 


-1 

1 


1 

-1 

0 

0 


0 

0 


y Z as 1 + (—1 ) 2 = 2 =/ 0. So Z is not 


4.2.8 Assume lli ^ U 2 and U 2 ^ lli . Take T? £ U 2 \ Ui (which is possible because U 2 ^ lli ) and it £ lli \ U 2 
(which is possible because Ui ^ U 2 ). If ti + £ Ui , then — as -i? is also in Ui — the sum of two vectors in Ui 

must also be in Ui giving 

ll + T? — it = 6 Ui, 

a contradiction. Similarly if Tt + T? £ U 2 , then — as —V also in U 2 — the sum of two vectors in U 2 must also be in 
Ui giving 

tt+T? — ti = ti g U 2 , 

another contradiction. Hence either Ui C U 2 or U 2 C Ui (or possibly both). 


4.2.9 Assume contrariwise that V = Ui (J U 2 U ‘ ‘ ‘ U Ux is the shortest such list. Since the Uj are proper subspaces, 
k > 1 . Choose ~x? £ Ui ,~x? U 2 JJ • • • (J Uk and choose Ui . Put L = {tjf + cxx?ja £ F}. Claim: L fj Ui = 0. For if 
it £ L fj Ui then 3ao £ F with u = + ao x? and so Tj* = u — aot* £ Ui , a contradiction. So L and Ui are disjoint. 


We now shew that L has at most one vector in common with Uj , 2 < j < k. For, if there were two elements of F, 
a =/b with + cfx* , l/ + b~x? £ Uj , j > 2 then 

(a — b) x? = (1? + ax?) — (V + bit) £ Uj , 


contrary to the choice of x? . 

Conclusion: since F is infinite, L is infinite. But we have shewn that L can have at most one element in common 
with the Uj . This means that there are not enough Uj to go around to cover the whole of L. So V cannot be a finite 
union of proper subspaces. 

4.2. 10 Take F = Z 2 , V = F x F. Then V has the four elements 









0 


0 


l 


l 

0 


T 


0 


T 


with the following subspaces 



It is easy to verify that these subspaces satisfy the conditions of the problem. 
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4 . 3.1 If 


then 


1 


1 


1 

0 

+ b 

1 

+ c 

1 

0 


0 


1 


a + b + c 


0 

b + c 

= 

0 

c 


0 


This clearly entails that c = b = a = 0, and so the family is free. 


4 . 3.2 Assume 


Then 


1 


1 


1 


1 


0 

1 

+ b 

1 

+ c 

-1 

+ d 

1 

_ 

0 

1 


-1 


1 


0 


0 

1 


-1 


-1 


1 


0 


a + b + c + d = 0, 
a + b — c + d = 0, 
a — b + c = 0, 
a — b — c + d = 0. 


Subtracting the second equation from the first, we deduce 2c = 0, that is, c = 0. Subtracting the third equation from 
the fourth, we deduce —2c + d = 0 or d = 0. From the first and third equations, we then deduce a + b = 0 and 
a — b = 0, which entails a = b = 0. In conclusion, a = b = c = d = 0. 


Now, put 


Then 



1 


i 


1 


1 


1 


1 


l 


-1 


1 


2 

X 

1 

+ y 


+ z 

1 

+ w 

0 


1 


1 




-1 


1 


1 


Solving as before, we find 


x + y+ z + w = l, 
x + y — z + w = 2, 
x — y +z = 1, 
x — y— z + w = l. 


1 


1 


1 


1 


1 

1 

1 

1 

1 

-1 


1 


2 

1 

+ 2 

-1 

~ 2 

1 


0 


1 

1 


-1 


-1 


1 


1 


2 
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4.3.5 We have 


( a ?1 +^2) — (^2+^3) + (^3+^4) — ( a ?4 +^1 ) = 0 , 


a non-trivial linear combination of these vectors equalling the zero-vector. 

4.3.7 Yes. Suppose that a + b y/2 = 0 Is a non- trivial linear combination of 1 and y/2 with rational numbers a and 
b. If one of a, b is different from 0 then so is the other. Hence 

a + bV~2 = 0 =^> y/2. = — — . 

a 

The sinistral side of the equality y/2. = — — is irrational whereas the dextral side is rational, a contradiction. 

4.3.8 No. The representation 2-1-1- (— V2)y/2 = 0 is a non-trivial linear combination of 1 and y/2. 


4.3.9 1 . Assume that 

If qc / 0, then 


a + bV2 + cV 3 = 0 , a, b, c, g Q, a 2 + b 2 + c 2 / 0. 

_ iu2 2 o 2 _ 

by/2 = -a cy/3 O 2b 2 = a 2 + 2acv / 3 + 3c 2 & — -= — = V 3 . 

2ac 


The dextral side of the last implication is irrational, whereas the sinistral side is rational. Thus it must be the 
case that ac = 0. If a = 0, c y 0 then 

bv / 2 + cv / 3 = 0O--= 

c V 2 

and again the dextral side is irrational and the sinistral side is rational. Thus if a = 0 then also c = 0. We can 
similarly prove that c = 0 entails a = 0. Thus we have 


bv / 2 = 0, 


which means that b = 0. Therefore 

a + b\/2 + cV3 = 0, a,b, c, G Q, ba = b = c = 0. 

This proves that {1 , y/2, V3\ are linearly independent over Q. 

2. Rationalising denominators, 


1 - y/2 VJ2-2 


1 + y/2 2y/U 


1 


!_ 

2 


12 


y/2 + -V3T 

-V2+lV3. 


4.3. 10 Assume that 
Then 

Letting x — > + 00 , we obtain c = 0. Thus 
and so 


ae x + be 2x + ce 3 * = 0. 
c = — ae _2x — be _x . 
ae x +be 2x =0, 


b = — ae x . 

Again, letting x — > + 00 , we obtain b = 0. This yields 


ae x = 0. 

Since the exponential function never vanishes, we deduce that a = 0. Thus a = b = c = 0 and the family is linearly 
independent over R. 

4.3.11 This follows at once from the identity 

cos 2x = cos 2 x — sin 2 x, 
cos 2x — cos 2 x + sin 2 x = 0. 


which implies 
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4.4. 1 Given an arbitrary polynomial 

p(x) = a + bx + cx 2 

we must shew that there are real numbers s, t, u, v such that 


dx 


p(x) = s + t(1 +x) +u(1 +x) 2 +v(1 +x) 3 . 

In order to do this we find the Taylor expansion of p around x = — 1 . Letting x = — 1 in this last equality, 

s = p(-l) = a- b + c- d£l. 


Now, 


Letting x = — 1 we find 
Again, 

Letting x = — 1 we find 
Finally, 


p'(x) = b + 2cx + 3dx 2 = t + 2u(l + x) + 3v(1 + x) 2 . 
t = p'(-1) = b - 2c + 3d G R. 
p"(x) = 2c + 6dx = 2u + 6v(1 + x). 
u = p" (— 1 ) = c — 3d G R. 


p'" (x) = 6d = 6v, 

so we let v = d G R. In other words, we have 

p(x) = a + bx + cx 2 + dx 3 = (a — b + c — d) + (b — 2c + 3d)(l + x) + (c — 3d)(1 + x) 2 + d(1 + x) 3 . 
4.4.2 Assume contrariwise that 


1 


i 


0 

1 

= a 

0 

+ b 

1 

-1 


-1 


-1 


Then we must have 


a = l, 


b = 1, 


—a — b = — 1 , 



1 


1 


0 


/ 

1 


0 

\ 

which is impossible. Thus 

1 

is not a linear combination of 

0 

) 

1 

and hence is not in span 


0 

> 

1 



-1 


-1 


-1 



-1 


-1 

/ 


4.4.3 It is 



1 0 


O 

o 


0 1 


a c 

a 


+ b 


+ c 


= 



0 0 


0 1 


-1 0 


— c b 


i.e., this family spans the set of all skew-symmetric 2x2 matrices over R. 


a 


i 


0 

2a — 3b 


2 


-3 

5b 

= a 

0 

+ b 

5 

a + 2b 


1 


2 

a 


1 


0 


4.5.1 We have 
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so clearly the family 








1 


0 



2 


-3 



0 

) 

5 



1 


2 



1 


0 



spans the subspace. To shew that this is a linearly independent family, assume that 


1 


0 


0 

2 


-3 


0 

0 

+ b 

5 

= 

0 

1 


2 


0 

1 


0 


0 


Then it follows clearly that a = b = 0, and so this is a linearly independent family. Conclusion: 






\ 


1 


0 



2 


-3 



0 

) 

5 



1 


2 



1 


0 



is a basis for the subspace. 
4 . 5.2 Suppose 


0 = a(v, + v 2 ) +b(v 2 + v 3 ) + c(v 3 + v 4 ) +d(v 4 + v 5 ) +f(v 5 + vi) 

= (a + f)T?i + (a + b)T? 2 + (b + c)a? 3 + (c + d)T? 4 + (d + ffv? 5 . 


Since , y ?2 , . . . , vV} are linearly independent, we have 

a + f = 0, 

a + b = 0 
b + c = 0 
c + d = 0 
d + f = 0. 

Solving we find a = b = c = d = f = 0, which means that the 

ftfl +^2,^2 +^3,^3 +^4,^4 +^5,^5 +^l} 

are linearly independent. Since the dimension of V is 5, and we have 5 linearly independent vectors, they must also 
be a basis for V. 


Answers and Hints 


223 


4.5.3 The matrix of coefficients is already in echelon form. The dimension of the solution space is n — 1 and the 
following vectors in R 2n form a basis for the solution space 


-1 


-1 

1 


0 

0 


1 

0 

, a -2 = 

0 

-1 


-1 

1 


0 

0 


1 

0 


0 


-1 

0 


1 


On 7 


0 


0 

1 


(The “second” —1 occurs on the n-th position. The l’s migrate from the 2nd and n + 1-th position on cm to the 
n — 1 -th and 2n-th position on u rl i .) 


4.5.4 Let A t = —A and B T = B be skew symmetric n x n matrices. Then if A £ R is a scalar, then 

(A + AB) T = -(A + AB), 


so A + AB is also skew- symmetric, proving that V is a subspace. Now consider the set of 

(n-1)n 


1+2 


(n-T 


matrices Ai<, which are 0 everywhere except in the vj-th and ji-spot, where 1 < i < j < n, = 1 = — ciji and i + j = k, 
3 < k < 2n — 1 . (In the case n = 3, they are 


1 

O 

O 


0 0 1 


0 0 0 

-10 0 

> 

0 0 0 

> 

0 0 1 

0 0 0 


-10 0 


0-10 


for example.) It is clear that these matrices form a basis for V and hence V has dimension 


(n-l)n 


2 


4.5.5 Take (tt,T?) £ X 2 and a £ R. Then 


■ ■ 


' 

a 


a' 

b 

, b + 2c = 0, = 

b' 

c 


c' 

d 


d' 


b' +2c' = 0. 


224 


Appendix A 


We have 


a + aa' 


4 ? + ccrf 


b + ab' 
c + ac' 


d + ad' 

and to demonstrate thatll + av G X we need to shew that (b + ab') + 2(c + ac') = 0. But this is easy, as 

(b + ab') + 2(c + ac') = (b + 2c) + a(b' + 2c') = 0 + aO = 0. 


Now 


a 


a 


i 


0 


0 

b 

_ 

-2c 

= a 

0 

+ C 

-2 

+ d 

0 

c 


c 


0 


1 


0 

d 


d 


0 


0 


1 


It is clear that 

1 

0 

0 

0 


0 


0 

-2 


0 

1 

> 

0 

0 


1 


are linearly independent and span X. They thus constitute a basis for X. 

4.5.6 As a basis we may take the n ^ n + - - matrices Eij G M n (F) for 1 < i < j < n. 

4.5.7 dimX = 2, as basis one may take {v? , vt). 

4.5.8 dimX = 3, as basis one may take {v?, vt, V 3 }. 

4.5.9 dimX = 3, as basis one may take {v?, vt, V 3 }. 

4.5.10 Let AGE. Observe that 


a b c 


a' b' c' 


a + Aa' b+Ab' c + Ac' 

0 d f 

+ A 

0 d' f ' 

= 

0 d + Ad' f + Af' 

0 0 g 


0 0 g' 


0 0 g+Ag' 


and if a + b + c = 0, a + d + g = 0, a' + b' + c' = 0, 


a' + d' + g' = 0. then 


cl -h Aa T b T Ab T c T Ac = (a + b + c) -T A(u +b + c ) = 0 + A0 = 0, 


and 

a + Aa' + d + Ad' + g + Ag' = (a + d+ g)+A(a' + d' + g')=0 + A0 = 0, 
proving that V is a subspace. 

Now, a + b + c = 0 = a + d+ g a = — b — c,g=b + c— d. Thus 


a 

b 

C 


— b — c 

b 

C 


-1 

1 

0 


-1 

0 

1 


0 

0 

0 


0 

0 

0 

0 

d 

f 

= 

0 

d 

f 

= b 

0 

0 

0 

+ C 

0 

0 

0 

+ d 

0 

1 

0 

+ f 

0 

0 

1 

0 

0 

g 


0 

0 

b + c — d 


0 

0 

1 


0 

0 

1 


0 

0 

-1 


0 

0 

0 
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It is clear that these four matrices span V and are linearly independent. Hence, dim V = 4. 
4.6. 1 1 . It is enough to prove that the matrix 


A = 


1111 
11 - 1-1 
1-1 1 -1 
1 - 1-1 1 
is invertible. But an easy computation shews that 

1111 
11 - 1-1 


A 2 = 


1 - 11-1 
1 - 1-1 1 


= 414, 


whence the inverse sought is 


1 

1 

1 

1 


1/4 

1/4 

1/4 

1/4 

1 

1 

-1 

-1 


1/4 

1/4 

-1/4 

-1/4 

1 

-1 

1 

-1 


1/4 

-1/4 

1/4 

-1/4 

1 

-1 

-1 

1 


1/4 

-1/4 

-1/4 

1/4 



2 . 


Since the H k are four linearly independent vectors in R 4 and dimR 4 = 4, they form a basis for R 4 . Now, we 
want to solve 


X 


1 

1 

1 

1 


X 


1 

V 


1 

1 

-1 

-1 


y 


2 

z 


1 

-1 

1 

-1 


z 


1 

w 


1 

-1 

-1 

1 


w 


1 


and so 


X 


1 


1/4 

1/4 

1/4 

1/4 


1 


5/4 

y 


2 


1/4 

1/4 

-1/4 

-1/4 


2 


1/4 


= A 1 


= 







= 


Z 


1 


1/4 

-1/4 

1/4 

-1/4 


1 


-1/4 

W 


1 


1/4 

-1/4 

-1/4 

1/4 


1 


-1/4 














1 


1 


1 


1 


1 

2 

5 

1 

1 

1 

1 

-1 

1 

-1 

1 

4 

1 

+ 4 

-1 

4 

1 

4 

-1 

1 


1 


-1 


-1 


1 


It follows that 
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The coordinates sought are 

5 1 1 1 \ 

4’ 4’~4’~47 ' 


3. Since we have 


1 


1 


1 


1 


1 

2 

5 

1 

1 

-1 

1 

1 

1 

-1 

1 

” 4 

1 

4 

1 

+ 4 

-1 

4 

-1 

1 


1 


-1 


-1 


1 


the coordinates sought are 


5 1 1 1 

4 ’ 4 ’ 4 ’ 4 


4.6.2 [ 1 ] a = 1 , [2] (A(a)r 1 = 


1 


0 0 


1 


a — 1 a — 1 

-1 1 - a -1 a + 1 

1 a 


a — 1 

1 


a 


1 0 
1 


a- 1 
-a 1 


[ 3 ] 


1 


1 


a — 1 
-a - 1 
a 


a - 1 
2 + a 


a — 1 

—a 

a 

a - 

a + 1 


1 


a- 1 
-1 
1 

a^T 

1 


5 . 1.1 Let a G R. Then 


x + aa 


(x + aa) — (y + ab) — (z + ac) 

y + ab 

= 

(x + aa) + (y + ab) + (z + ac) 

z + ac 


z + ac 


x — y — z 


a — b — c 

x + y + z 

+ a 

a + b + c 

z 


c 


X 


a 

y 

+ aL 

b 

z 


c 


proving that L is a linear transformation. 
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5.1.2 


L(H + aH') = -A ’(H + aH'lA 1 

= -A T HA 1 + cx(-A ’H'A 1 ) 

= L(H) + «L(H'), 

proving that L is linear. 

5.1.3 Let S be convex and let b G T(S). We must prove that Va G [0; 1], (1 — a) cl + ab G T(S). But since h, b 
belong to T( S ) , 33? G S , tj* G S with T (7? ) = ci , Tf ) = 1 j . Since S Is convex, (1 - a ) X* + ay* G S . Thus 

T((1 — a)3? + al?) G T(S), 

which means that 

(1-a)T0*) + «xTtf) € T(S), 

that Is, 

(1 — a)~cf + at> G T(S), 

as we wished to show. 


5.2.1 Assume 


G ker (L). Then 


X 


0 

y 

= 

0 

z 


0 


that Is 

x — y - z = 0, 
x + y + z = 0, 
z = 0. 

This implies that x — y = 0 and x + y =0, and so x = y = z = 0. This means that 


ker (L) = < 






0 



0 



1 

O 



and L Is Injective. 


By the Dimension Theorem 244, dimlm (L) = dim V — dim ker (L) =3 — 0 = 3, which means that 

Im (L) = R 3 


and L is surjective. 

5.2.2 

1 . If a is any scalar, 



/ 

X 


x' 

\ 


x + ax' 








X 


x' 

L 


V 

+ a 

V' 


= L 

y + ay' 

— 

(x + ax') + (y + ay') 

_ 

x + y 

+ a 

x' +y' 

= L 

V 

+ aL 

y' 



z 


z' 



z + az' 


(x + ax') - (y + ay') 


x-y 


*'■ y' 


z 


z' 



w 


w' 



w + aw' 








w 


w' 
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whence L is linear. 
2. We have, 













X 







0 


L 

v 

= 

x + y 

= 

0 

=£> x = y,x = — y =£• x = y=0 =^> ker (L) = < 


0 

: z £ K, w G K 


z 


*-y 


0 



z 



w 







w 



Thus dimker (L) = 2. In particular, the transformation is not injective. 

3. From the previous part, dim Im ( L ) =4 — 2 = 2. Since Im (L) C R 2 and dimlm(L) = 2. we must have 
Im (L) = R 2 . In particular, the transformation is surjective. 


5 . 2.3 Assume that 


G ker(T), 


a 


1 


1 


0 

b 

= (a-b) 

0 

+ b 

1 

+ c 

0 

c 


0 


0 


1 


Then 


0 

0 

0 

0 


a 


T 


b 


c 


1 


1 


0 

0 

+ bT 

1 

+ cT 

0 

0 


0 


1 


1 


2 


1 

0 

+ b 

-1 

+ c 

-1 

-1 


0 


1 

0 


0 


0 


a + b + c 
— b — c 
—a + b + c 


0 
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It follows that a = 0 and b = — c. Thus 


and so dim ker (T) = I . 




0 

> 

ker (T) = < 

c 

-1 

: c e R 



1 



By the Dimension Theorem 244, 

dimlm (T) = dim V — dim ker (T) =3 — 1 =2. 


We readily see that 


and so 


5 . 2.4 Assume that 


Then x = — 2y and so 


This means that dim ker (L) 
injective. 


2 


1 


1 

-1 


0 

+ 

-1 

0 


-1 


1 

0 


0 


0 



/[ 

1 

r 


Im (T) = span 


I 

0 

-1 

0 


I 

-1 

I 

0 





x + 2y 


0 


X 





L 

y 


x + 2y 


0 




0 


0 


X 


-2 


= y 


y 


1 


I and ker (L) is the line through the origin and (—2, 1 ). Observe that L is not 


a 


By the Dimension Theorem 244, dimlm (L) = dim V — dim ker (L) =2 


1=1. Assume that 


b 


G Im (L). 


Then 3(x, y ) E R I 2 such that 





x + 2y 


a 

L 

X 

= 

x + 2y 

= 

b 


y 


0 


c 
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This means that 


a 


x + 2y 


1 

b 

= 

x + 2y 

= (x + 2y) 

1 

c 


0 


0 


Observe that L is not surjective. 
5 . 2.5 Assume that 





x-y 


0 


X 





L 

V 


x + y 


0 




0 


0 


Then x+y=0=x— y, that is, x = y = 0, meaning that 




0 

1 

ker (L) = < 






0 

1 


and so L is injective. 


a 


By the Dimension Theorem 244, dimlm (L) = dim V — dimker (L) =2 


0 = 2. Assume that 


b 


G Im (L). 


Then 3(x, y ) G R 2 such that 


This means that 


Since 





x-y 


a 


X 





L 

V 


x + y 


b 




0 


c 


a 


x-y 


l 



b 

= 

x + y 

= X 

l 

+ y 

l 

c 


0 


0 


0 


i 



i 

> 

1 

0 


0 


are linearly independent, they span a subspace of dimension 2 in R 3 , that is, a plane containing the origin. Observe 
that L is not surjective. 

5 . 2.6 Assume that 


x 

y 

z 


X — y — z 

y — 2z 


0 

0 


L 
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Then y = 2z;x = y + z = 3z. This means that ker (L) = < 

injective. 

Now, if 


: z G 


> . Hence dim ker (L) = 1, and so L is not 


X 




- - 



X — y — z 


a 

V 


y — 2z 


b 

z 






Then 


a 

. 

x — y — z 

= X 

1 

+ y 


+ Z 

-1 

b 


y — 2z 


0 


i 


-2 


Now, 


1 


-1 


- 2 


0 


1 


-1 

-2 


and 


1 


are linearly independent. Since dimlm (L) = 2, we have Im (L) = R 2 , and so L is surjective. 
5.2.7 Assume that 


0 = tr 


( 

a b 

\ 

V 

c d 

/ 


= a + d. 


Then a = — d and so. 


a b 


-d b 


-1 

0 


0 

1 


0 

0 


= 


= d 



+ b 



+ c 



c d 


c d 


0 

1 


0 

0 


1 

0 


( 

a 0 

\ 

V 

0 0 

/ 


and so dim ker (L) =3. Thus L is not injective. L is surjective, however. For if a G R, then 

a = tr 

5.2.8 1. Let (A,B) 2 G M 2X 2 (R), a G R. Then 

L(A+aB) = (A + aB) T + (A + aB) 

= A T + B T + A + aB 
= A t T A T aB T + aB 
= L(A) + aL(B), 


proving that L is linear. 
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2. Assume that A = 


a 


c 


b 

d 


g ker (L). Then 


0 0 


a b 


a c 


2a b + c 


= L(A) = 


+ 


= 


0 0 


c d 


b d 


b + c 2d 


whence a = d = 0 and b = — c. Hence 


ker (L) 


= span 


/ 

0 

-i 

\ 

V 

i 

0 

/ 


and so dim ker (L) = 1 . 

3. By the Dimension Theorem, dimlm ( L) =4 — 1 = 3. As above, 


L(A) 


2a b + c 


b + c 2d 


2 0 


0 1 


0 0 


+ (b + c) 


+ d 


0 0 


1 0 


0 2 


from where 


Im (L) = span 


/ 

2 0 


0 1 


0 0 

\ 

V 

0 0 

> 

1 0 

> 

0 2 

) 


5.2.9 O Observe that 

proving the result. 

© The Inverse Is I — jT, for 


(I - T) 2 = I 2T + T 2 = I - 2T + T = I - T, 


(I + T)(I - It) = I + T ^T - It 2 = I + T - It - It = I, 


proving the claim. 
© We have 


1? g ker (T) 


3? — T d? ) g ker (T) 
Id?) — Tfj?) G ker (T) 
(I-T)(l?) g ker(T) 

3? g Im (I — T) . 


5.3. 1 Observe that 


a 


1 


1 


1 


-1 

b 

= d 

1 

+ (2a — c — b) 

0 

+ (— d — 2a + 2c + b) 

1 

+ (—a + c) 

-2 

c 


1 


1 


1 


0 

d 


1 


0 


0 


0 
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Hence 


a 


1 


1 


1 


-1 

b 

= dT 

1 

+ (2a - c-b)T 

0 

+ (— d — 2a + 2c + b)T 

1 

+ (—a + c)T 

-2 

c 


1 


1 


1 


0 

d 


1 


0 


0 


0 


0 


1 


0 


1 

0 

+ (2a — c — b) 

1 

+ (— d — 2a + 2c + b) 

0 

+ (—a + c) 

1 

1 


-1 


-1 


1 


a-b 


a-b 
-a + 2d 


This gives 


1 




0 




0 




0 





1 




-1 




0 




0 

0 

= 

1 

, T 

1 

= 

-1 

, T 

0 

= 

0 

, T 

0 

= 

0 

0 


-1 


0 


0 


1 


0 


0 


2 

0 




0 




0 




1 




The required matrix is therefore 


1-10 0 

1-10 0 

-10 0 2 


This matrix has rank 2, and so dimlm (T) = 2. We can use < 


> as a basis for Im (T). Thus by the 


dimension theorem dimker (T) = 2. If 





L J L J 


_ . 


a 




2d 

0 




a-b 



0 

= T 

b 

- 

a-b 

, Hence the vectors in ker (T) have the form 

2d 



c 




c 

0 




—a + 2d 





d 




d 


2 


0 

2 


0 

0 

> 

1 

1 


0 


and hence we may take < 


> as a basis for ker (T). 
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5.3.2 1. Since the image of T is the plane x + y + z = 0, we must have 


a + 0 + 1 = 0 a = — 1 , 

3 + b — 5 = 0 => b =2, 
— 1 + 2 + c = 0 =^> c = — 1 . 


2. Observe that 


G ker (T) and so 


1 


0 

1 

= 

0 

1 


0 


Thus 



1 


2 


1 


3 

T 

0 

= T 

1 

-T 

1 

= 

2 


0 


1 


1 


-5 



0 


1 


1 


-1 

T 

1 

= T 

2 

-T 

1 

= 

2 


0 


1 


1 


-1 



0 


1 


1 


-1 

T 

0 

= T 

1 

-T 

1 

= 

0 


1 


2 


1 


1 


The required matrix is therefore 


3 -1 -1 

2 2 0 


-5 -1 1 
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5 . 3.3 1 . Let a g R. We have 


( 






( 


\ 

X 

+ a 

u 

= T 

x + au 





V 

V 


V 

) 


\ 

y + av 

/ 


x + cm + y + av 
x + cm — y — av 


2(x + cm) + 3(y + av) 


x + y 


u + V 

x-y 

+ a 

U — V 

2x + 3y 


2u + 3v 



( 

X 



( 

u 

\ 

T 




+ aT 






y 

) 



V 

/ 


proving that T is linear. 
2. We have 


/ 

X 

\ 

V 

y 

/ 


0 


x + y 


0 

0 

<=4> 

x-y 

= 

0 

0 


2x + 3y 


0 


dimker ( T) =0, and whence T is injective. 

3. By the Dimension Theorem, dimlm ( T) =2 — 0 = 2. Now, since 


x = y = 0, 


whence 


4. We have 


and 



( 

X 



X + y 


1 


i 

T 


y 

) 


x-y 

= X 

1 

+ y 





2x + 3y 


2 


3 



/ 




\ 


1 


1 

Im (T) = span 


1 

> 

-1 




2 


3 

/ 


1 



3 

11 

i 

5 

1 

13 

0 


11/2 



= 

-1 

2 

i 

~ 2 

0 

2 

i 

— 

-5/2 

2 














8 


i 


-1 


0 


-13/2 

1 



4 

15 

i 

7 

1 

19 

0 


15/2 



= 

-2 

~ T 

i 

~ 2 

0 

~ T 

1 

= 

-7/2 

3 

) 













11 


i 


-1 


0 


-19/2 


ss 


38 
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The required matrix is 


11/2 

15/2 

-5/2 

-7/2 

-13/2 

-19/2 


- 1 36 


5.3.4 The matrix will be a 2 x 3 matrix. In each case, we find the action of L on the basis elements of R 3 and express 
the result in the given basis for R 3 . 

1 . We have 


/ 


\ 




/ 


\ 




/ 


\ 




1 





0 





0 


- 





1 






2 






0 


0 


— 

3 

X 


1 


— 

0 

X 


0 


— 

-1 

V 

0 

) 





0 

) 





1 

) 




The required matrix is 

1 2 0 


2. We have 


The required matrix is 


3 0 


/ 

i 

\ 




( 

1 

\ 





1 






0 


' 

3 



1 


V 

0 






0 

/ 


-1 




( 

1 




3 

X 


1 


_ 

3 

3 






2 




1 





1 3 3 


3. We have 


The required matrix is 


3 3 2 


/ 


L 


V 





1 


1 


i 


-2 

= 


= -2 


+ 3 


= 



3 


0 


i 


3 


v 


\) 


3 


1 


i 


0 


= 0 


+ 3 


= 


3 


0 


i 


3 


3 


1 


1 


1 


= 1 


+ 2 


= 


2 


0 


1 


2 


-2 0 1 


3 3 2 
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5.3.5 Observe that 


6lm(T) = ker (T) and so 


Now 



1 


( 

1 


2 



1 


2 


6 

T 


= T 

3 


— 



= 3T 


-T 


= 



0 



1 


3 

) 


1 


3 


9 


and 




( 




\ 







0 

= T 

2 

- 2 

1 


= T 

2 

2T 

1 

= 

-4 


1 


\ 

3 


1 

) 


3 


1 


-6 


The required matrix is thus 


6 -4 
9 -6 


5.3.6 The matrix will be a 1 x 4 matrix. We have 


tr 


tr 


tr 


tr 


/ 

1 

0 

V 

0 

0 

l 

0 

1 

V 

0 

0 

1 

0 

0 

V 

i 

0 

1 

0 

0 

V 

0 

1 


= 1 , 


= 0, 


= 0 , 


= 1. 


Thus 


Ml = (1 0 0 1). 

5.3.7 First observe that ker (B) C ker (AB) since VX e M qx i ( IR ) , 

BX = 0 =£> (AB)X = A(BX) = 0. 

Now 

dim ker ( B ) = q — dim Im ( B ) 

= q — rank ( B ) 

= q — rank (AB) 

= q — dim Im ( AB ) 


dim ker (AB) . 
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Thus ker (B) = ker (AB) . Similarly, we can demonstrate that ker (ABC) = ker (BC) . Thus 

rank (ABC) = dimlm(ABC) 

= r — dimker (ABC) 

= r — dimker (BC) 

= dimlm(BC) 

= rank (BC) . 

6.2.1 This is clearly (1 2 3 4)(6 8 7) of order lcm(4, 3) =12. 

6.3.1 Multiplying the first column of the given matrix by a, its second column by b, and its third column by c, we 
obtain 


abed = 


ab c ab c ab c 


a 


a b J c 6 

We may factor out ab c from the first row of this last matrix thereby obtaining 


abed = abc det 


1 1 1 


Upon dividing by abc, 

a = det 

6.3.2 Performing Ri + R 2 + R 3 — > Ri we have 

a — b — c 


1 1 1 


a = det 


2b 

2c 


2a 

b — c — a 
2c 


2a 

2b 

c — a — b 


a+b+c a + b + c a+b+c 
det 2b b — c — a 2b 

2c 2c c — a — b 

Factorising (a + b + c) from the first row of this last determinant, we have 

1 1 1 

O = (a + b + c) det 2b b-c-a 2b 

2c 2c c — a — b 
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Performing C 2 — Ci — > C 2 and C 3 — Ci — > C 3 , 


12 = (a + b + c) det 


1 

2 b 

2 c 


0 

b — c — a 
0 


0 
0 

c — a — b 


This last matrix is triangular, hence 

O = (a + b + c)(— b — c — a)(— c — a — b) = (a + b + c) 3 , 


as wanted. 


6.3.3 detAi = det A = —540 by multilinearity. detA 2 = detAi = 540 by alternancy. det A3 = 3 detA 2 = 1620 
by both multilinearity and homogeneity from one column, det A4 = det A3 = 1 620 by multilinearity, and det As = 
2 det A 4 = 3240 by homogeneity from one column. 

6.3.5 From the given data, det B = —2. Hence 

det ABC = (det A) (det B) (det C) = -12, 

det 5AC = 5 3 detAC = (125) (det A) (det C) = 750, 

(det A ) 3 27 


(det A 3 B 3 C 1 


(det B ) 3 (det C) 16 


6.3.6 Pick A G R \ (0, an , CI 22 , . . . , a n nj. Put 


X = 


an — A 0 0 

a 2 i a 2 2 — A 0 


a 3 i 


01 n 1 


a32 a33 — A 


an 2 


an3 


0 

0 

0 

Unn A 


and 


y = 


A ai 2 an 
0 A a23 
0 0 A 


a lT 

a 2T 

a 3T 


Clearly A = X + Y, det X = (an — A) ( 022 — A) • • • ( a n n — A) =/ 0, and det Y = A n =/ 0. This completes the proof. 

6.3.7 No. 

6.4. 1 We have 


4 6 


1 3 


1 3 


+ 5(— 1 ) 2+2 det 


+ 8(— 1 ) 2+3 det 


7 9 


7 9 


4 6 


= -2(36 -42) +5(9 - 21) - 8 ( 6 - 12) = 0. 


det A 
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6 . 4.2 Simply expand along the first row 


a b 


c b 


c a 


b det 


+ c det 


c a 


b a 


b c 


a(a 2 — be) — b(ca — b 2 ) + c(c 2 — ab) = a 3 + b 3 + c 3 — 3abc. 


6 . 4.3 Since the second column has three O’s, it is advantageous to expand along it, and thus we are reduced to 
calculate 


-3f— 1 ) 3+2 det 


1 -1 1 

2 0 1 

1 0 1 


Expanding this last determinant along the second column, the original determinant is thus 


-3(— 1 ) 3+2 (— 4 )(— 1 ) 1+2 det 


2 1 
1 1 


3C— 1 )(—!)(— 1)0) =3. 


6 . 4.5 Expanding along the first column. 



1 

i 

1 

1 





X 

a 

0 

0 




det 









X 

0 

b 

0 





X 

0 

0 

c 





a 

0 

0 


1 

1 

1 

det 

0 

b 

0 

— 

x det 

0 

b 

0 


0 

0 

c 


0 

0 

c 


1 

1 

1 


1 

1 

1 

a 

0 

0 

— x det 

a 

0 

0 

0 

0 

c 


0 

b 

0 



1 

1 

1 


1 

1 

1 

xabc — xbc + x det 

a 

0 

0 

— x det 

a 

0 

0 


0 

0 

C 


0 

b 

0 
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Expanding these last two determinants along the third row, 



1 1 1 


i i i 

0 = abc — xbc + x det 

a 0 0 

— x det 

a a 0 


0 0c 


0 b 0 


r 

r 




1 1 


1 1 

abc — xbc + xc det 

a 0 

+ xb det 

a 0 


= abc — xbc — xca — xab. 


It follows that 
whence 


abc = x(bc + ab + ca), 

1 be + ab + ca 1 1 1 

x abc abc’ 


as wanted. 

6.4.7 Expanding along the first row the determinant equals 



a 

b 

0 


a 

0 

0 




r -| 










a b 


a b 

a det 

0 

0 

b 

+ b det 

0 

a 

b 

= ab det 

1 1 

+ ab det 

1 1 


1 

1 

1 


1 

1 

1 






= 2ab(a — b), 


as wanted. 

6.4.8 Expanding along the first row, the determinant equals 


a 0 b 


0 a b 

0 d 0 

+ b det 

c 0 0 

c 0 d 


0 c d 


Expanding the resulting two determinants along the second row, we obtain 


ad det 


a b 


a b 


+ b(— c) det 


c d 


c d 


ad(ad — be)— bc(ad — be) = (ad — be) 2 , 


as wanted. 

6.4.9 For n = 1 we have det(l ) = 1 = (—1 J 1 +1 . For n = 2 we have 


det 


1 


1 


= -1 = (- 1 ) 


2+1 


1 0 
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Assume that the result is true for n — 1 . Expanding the determinant along the first column 


1 1 1 

1 0 0 


det 


0 


1 


0 0 


0 

1 


0 0 0 


1 1 

0 0 
0 0 
0 0 


1 det 


0 0 
1 0 
0 1 


0 0 
0 0 
0 0 


1 0 


0 0 ••• 1 0 
1 1 ••• 1 1 


1 0 ••• 0 


— 1 det 


0 


1 


0 0 


0 

0 


0 0 ••• 1 0 

= 1 ( 0 ) — ( 1 )(— 1 ) n 
= (- 1 ) n+ \ 


giving the result. 


6.4.10 Perform Ck~ Ci — > Ck for k g [2; n]. Observe that these operations do not affect the value of the determinant. 
Then 


1 n — 1 n — 1 n — 1 ••• n— 1 

n 2 — n 0 0:0 


n 


n 


det A = det 


n 


n 


0 

0 


n 0 0 0 0 0 
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Expand this last determinant along the n-th row, obtaining. 


n — 1 n — 1 n — 1 ••• n— 1 n — 1 

2 — n 0 0:00 


det A 


(— 1 ) 1+Tl n det 


0 

0 


3 -n 0 


0 4 -n 


0 0 
0 0 


0 0 0 


-1 0 


= (— 1 ) 1+Tl n(n — 1 ) (2 — n) (3 — n) 

11 1 ...11 


10 0 : 0 0 


(~2) (—1 ) det 


0 1 0 
0 0 1 


0 0 
0 0 


0 0 0 

11 1 ...11 

10 0 loo 


- ( n ! ) det 


0 1 0 
0 0 1 


0 0 
0 0 


0 


00 0 ... 1 0 

= — (n! ) ( — 1 ) n 

= (— 1 ) n+1 n!, 


upon using the result of problem 6.4.9. 


6 . 4.11 Recall that (") = ( n " k ), 



and 


=°, if n > 0. 


Assume that n is odd. Observe that then there are n + 1 (an even number) of columns and that on the same row, 
( k ) is on a column of opposite parity to that of ( n n k ) . By performing Ci — C 2 + C 3 — C 4 + • • • + C u — C n +i — > Ci , 
the first column becomes all 0’s, whence the determinant if 0 if n is odd. 
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6 . 4.15 I will prove that 


det 


Using permissible row and column operations, 


(b + c) 2 

ab 

ac 

ab 

(a + c) 2 

be 

ac 

be 

(a + b) 


= 2abc(a + b + c) 



(b + c) 2 

ab 

ac 


b 2 + 2bc + c 2 

ab 

ac 

det 

ab 

(a + c) 2 

be 

= det 

ab 

a 2 + 2ca + c 2 

be 


ac 

be 

(a + b) 2 


ac 

be 

a 2 + 2ab + b 2 


det 


b 2 + 2bc + c 2 + ab + ac 


ab 


ab + a 2 + 2ca + c 2 + be a 2 + 2ca + c 2 


ac 


be 


det 


ac + be + a 2 + 2ab + b 2 


(b + c)(a + b + c) 


be 


ab 


a + 2ab + b 


ac 


(a + c)(a + b + c) a 2 + 2ca + c 2 


(a + b)(a + b + c) 


be 


be 


a 2 + 2ab + b 2 


Pulling out a factor, the above equals 


b + c 

ab 

ac 

a + c 

a 2 + 2ca + c 2 

be 

a + b 

be 

a 2 + 2ab + b 


(a + b + c) det 


and performing Ri + R2 + R3 — > Ri , this is 

2a + 2b + 2c ab + a 2 + 2ca + c 2 + be ac + be + a 2 + 2ab + b 2 
a + c a 2 + 2ca + c 2 be 

a + b be a 2 +2ab+b 2 


(a + b + c) det 


Factoring this is 


(a + b + c) det 


2(a + b + c) (a+c)(a + b + c) (a + b)(a + b + c) 
a+c a 2 + 2ca + c 2 be 

a + b be a 2 + 2ab + b 2 


which in turn is 


(a + b + c) 2 det 


2 a+c a+b 

a + c a 2 + 2ca + c 2 be 

a + b be a 2 + 2ab + b 2 
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Performing C 2 — (a + c)Ci — > C 2 and C3 — (a + b)Ci — > C3 we obtain 


a — c 


a — b 


(a + b + c) det 


a + c 


a — ab — ac 


a + b —a — ab — ac 


This last matrix we will expand by the second column, obtaining that the original determinant is thus 


a + b + c) 2 

f 

(a + c) det 

a + c 

—a 2 — ab — ac 

+ (a 2 + ab + ac) det 

2 

-a- b 

\ 



a + b 

0 


a + c 

—a 2 — ab — ac 

) 


This simplifies to 


(a + b + c) 2 ((a + c)(a + b)(a 2 + ab + ac) 
+ (a 2 + ab + ac)(— a 2 — ab — ac + be)) 


a(a + b + c) 3 ((a+c)(a + b) — a 2 — ab — ac + bc) 
2abc(a + b + c) 3 , 


as claimed. 


6.4.16 We have 
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det 


abed 
d a b c 
c d a b 
b c d a 


det 


c + d a+b+c+d a+b+c+d a+b+c+d 


(a + b + c + d) det 


C4 — C 3 + C 2 — C ■] — > C 4 
— > 


(a + b + c + d) 


(a + b + c + d) (a — b + c — d) 


1111 

d a b c 

c d a b 

b c d a 

111 0 

d a b c — b + a — d 

c d a b — a + d — c 

b c d a — d + c — b 

1110 
dab 1 
c d a — 1 

b c d 1 


R 2 ~\~ R 3 — ^ R 2 > R 4 + R 3 — ^ R 4 


(a + b + c + d) (a — b + c — d) 


1 


(a + b + c + d) (a — b + c — d) 


Cl C3 ^ Cl ^ C2 c 3-^ c 2 ( a + b + c + d )( a _ b + c _ d ) 


1 1 

d+c a+d b+a 0 
c d a —1 

b + c c + d a+d 0 

1 1 1 

d+c a+d b+a 
b+c c+d a+d 

0 0 1 
d + c — b — a d — b b + a 
b + c — a — d c — a a + d 


(a + b + c + d) (a — b + c 


d) 


d + c — b 
b + c — a 


a d — b 

d c — a 


(a + b + c + d) (a — b + c — d)(d + c — b — a)(c — a) — (d — b)(b + c — a — d) 

(a + b + c + d)(a — b + c — d) 

( ( c — a)(c — a) + (c — a)(d — b) — (d — b)(c — a) — (d — b)(b — d)) 

(a + b + c + d) (a — b + c — d)f(a — c) 2 + (b — d) 2 ). 
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Since 

(a — c) 2 + (b — d) 2 = (a — c + i(b — d))(a — c — l(b — d)), 
the above determinant is then 


(a + b + c + d)(a — b + c — d)(a + ib — c — id) (a — ib — c + id). 
Generalisations of this determinant are possible using roots of unity. 


7.2.1 We have 


det(AI 2 — A) = det 



1 



whence the eigenvalues are 0 and 2. For A = 0 we have 


(A-1) 2 -1 = A(A — 2), 


0I 2 



-1 


This has row-echelon form 


If 


then a = b. Thus 


1 -1 

0 0 


1 -1 


a 


0 

o 

o 


b 


0 


■ - 


■ ■ 

a 

= a 

i 

b 


i 


and we can take 


1 

1 


as the eigenvector corresponding to A 


0. Similarly, for A = 2, 


which has row- echelon form 


If 


1 


2I 2 — A = 


1 


3 

3 


1 3 

0 0 


■ 





■ ■ 

i 

3 


a 


0 

0 

0 


b 


0 



then a = —3b. Thus 
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and we can take 


7 . 2.5 O We have 


as the eigenvector corresponding to A = 2. 


det(Al 3 — A) = det 


A -2 1 

-2 A - 3 2 

1 2 A 


A — 3 2 


-2 2 


-2 A — 3 


+ 2 det 


+ det 


2 A 


1 A 


1 2 


A det 


= A(A 2 -3A-4) + 2(-2A-2) + (-A-T 
= A(A — 4) (A + 1 ) — 5(A + 1 ) 

= (A 2 -4A-5)(A + 1 ) 


© The eigenvalues are — 1 , — 1 , 5. 
© If A = -1 , 


-Is - A) = 


= (A + 1 ) 2 (A — 5) 


-1 -2 1 
-2 -4 2 

1 2 -1 



a 


0 


b 

= 

0 


c 


0 


<^=^> a = —2b + c 


a 


-2 


1 

b 

= b 

1 

+ c 

0 

c 


0 


1 



-2 


1 

We may take as eigenvectors 

1 

) 

0 


0 


1 


, which are clearly linearly independent. 


If A = 5, 


(51s - A) = 


5 -2 1 


a 


0 

-2 2 2 


b 

= 

0 

1 2 5 


c 


0 


<^=4> a = — c, b = —2c, 



a 


-1 


b 

= c 

-2 


c 


1 
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We may take as eigenvector 


1 

2 

-1 


7.2.6 The characteristic polynomial of A must be A^ — 1 , which means that tr (A) = 0 and det A = —1 . Hence A 
a c 


must be of the form 


b -a 


, with —a — be = — 1 , that is, a + be = 1 . 


7.2.7 We must shew that det ( AI n — A ) = det ( AI n — A ) T . Now, recall that the determinant of a square matrix is the 
same as the determinant of its transpose. Hence 

det(AI„-A) = det ((AI n - A) T ) =det(AI^-A T ) = det(AI n - A T ), 


as we needed to shew. 

7.3.1 Put 


D = 

1 

0 

, ,P- 

1 

0 


0 

-2 


1 

-1 


We find 


P ' = 


1 1 

0 -1 


Since A = PDP 


A 10 = PD 10 P 1 = 


1 0 


1 0 


1 1 


1 -1023 

1 -1 


0 1024 


0 -1 


0 1024 


7.3.2 


1 . A has characteristic polynomial det 

2. (A — 1 ) (A + 1 ) = 0 =4 A 

3. For A = — 1 we have 

9 -4 

20 -9 


A — 9 4 

-20 A + 9 


= (A-9)(A + 9) +80 = A 2 -1 = (A — 1)(A + 1). 


= -1 


10a = 4b =4 a = 


2b 


so we can take 


For A = 1 we have 


so we can take 


as an eigenvector. 

9 -4 

20 -9 

as an eigenvector. 


8a = 4b a > 
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4. We can do this problem in at least three ways. The quickest is perhaps the following. 


Recall that a 2 x 2 matrix has characteristic polynomial A 2 — (tr (A)) A + det A. Since A has eigenvalues — 1 
and 1, A 20 has eigenvalues l 20 = 1 and ' I) 20 = 1, i.e., the sole of A 20 is 1 and so A 20 has characteristic 
polynomial (A — 1 ) 2 = A 2 — 2A + 1 . This means that — tr (A 20 ) = —2 and so tr (A 20 ) = 2. 


The direct way would be to argue that 


A 20 


2 


.20 . 


-1 0 


2 


_ -i 


1 


5 2 


0 


5 2 


i 



and so a + d = 2. One may also use the fact that tr (XY) = tr (YX) and hence 

tr (A 20 ) = tr (PD 20 P 1 ) = tr (PP _1 D 20 ) = tr (d 20 ) = 2. 

7 . 3.3 Put 


-1 

0 

0 


1 

1 

1 

0 

-1 

0 

, X = 

0 

1 

1 

0 

0 

3 


0 

0 

1 


Then we know that A = XDX 1 and so we need to find X 1 . But this is readily obtained by performing Ri — R 2 — > Ri 
and R 2 — R 3 — > R 3 in the augmented matrix 


p 





1 

1 

1 

1 

1 

0 

0 

0 

1 

1 

0 

1 

0 

0 

0 

1 

0 

0 

1 


getting 


x- 1 


1 -1 0 
0 1 - 1 - 


0 0 1 
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Thus 



-10 0 


1 

O 

r— 

1 

1 


0-10 


T 

o 

1 

0 0 3 


1 

o 

o 

1 


A = XDX 1 
1 1 1 
0 1 1 
0 0 1 
-10 4 

0-14 
0 0 3 

7.3.4 The determinant is 1 , A = A -1 , and the characteristic polynomial is (A 2 — 1 ) 2 . 
7.3.6 We find 

A + 7 6 


det(Al 2 — A) = det 


-12 A — 10 


= A — 3A + 2 = (A — 1 )(A — 2). 


A short calculation shews that the eigenvalue A = 2 has eigenvector 


2 

-3 


and that the eigenvalue A = 1 has 


eigenvector 


3 

-4 


. Thus we may form 


D = 


2 0 
0 1 


P = 


2 -3 

3 -4 


P ' = 


4 -3 

3 1 


This gives 


A = PDP 


A n = PD n P 1 


2 -3 


I 

o 

£ 

<N 

i 


4 

-3 


—8 • 2 n + 9 

—6 • 2 n + 6 

3 -4 


0 1 


3 

1 


12 • 2 n - 12 

9 • 2 n - 8 


7.4.1 The eigenvalues of A are 0, 1 , and —2. Those of A 2 are 0, 1 , and 4. Hence, the characteristic polynomial of 
A 2 is A(A - 1 ) (A -4). 


8 . 2.1 V2a 2 - 2a + 1 

8.2.2 ||a7|| = 1 =4, 7(A) 2 + (—A) 2 = \ 


2 _ 1 


2A = 


A = ± 


78 ' 


8.2.3 0 

8.2.4 a = ±1 or a = —8. 

8.2.5 [A] 2(7 +7) — 2-7, [B]7 +7 — [C] —(7 +7 +7) 

8.2.6 [A], 7, [B].7, [C].7, [D].7. [E].27(=2d) 

8.2.7 [F], 7, [G], 7, [H], 27, [I]. 7. 
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8.2.8 Let the skew quadrilateral be ABCD and let P, Q, R, S be the midpoints of [A, B], [B, C], [C, D], [D, A], respec- 
tively. Put 5? = OX, where X g {A, B, C, D, P, Q, R, S}. Using the Section Formula 8.4 we have 


_> X + b b +X 

P= y — , q= — . r 




This gives 


X"X 


2 ' 

X-X 


s = 


X+X 


X = 


X -X 


2 2 

This means that QP = Rl> and so PQRS is a parallelogram since one pair of sides are equal and parallel. 

8 . 2.9 We have 2 bX = ft + eX. By Chasles’ Rule aX = a! + eX, and BE) = b! + eTd. We deduce that 

aX + RE) = aX + eX + Bli + eX = aB + bX. 

But since ABCD is a parallelogram, AE) = lid. Hence 

,^ + BE>=AE> + bX = 2bX. 

8 . 2.10 We have iX = -3lll <=^ iX = -3(lX + All ) = -3lX - 3All. Thus we deduce 

iX + 3lX = — 3All 4lX = — 3All 

4AI = 3 All 
a! = I All. 

Similarly 

jX = -lj^ «=* 3jX = fi 

3jX = — jX — All 
4jX = -All 
Al = lAt 


Thus we take I such that AI = §AB and J such that AJ = 4-AB. 


Now 


and 


MA + 3MB = MI + IA + 3IB 
= 4 MI + I A + 3 IB 

= 4M!, 

3MA I- MB = 3MJ + 3JA + MJ + JB 
= 4JVl| + 3jX + jll 
= 4m). 


8.2.11 Let G, O and P denote vectors from an arbitrary origin to the gallows, oak, and pine, respectively. The 
conditions of the problem define X and X , thought of similarly as vectors from the origin, by X = X I R ( O (3 ) , 
X = X — R (X — (I ) , where R is the 90° rotation to the right, a linear transformation on vectors in the plane; the fact 
that — R is 90° leftward rotation has been used in writing Y. Anyway, then 

X+X X+X R(cl-X) 


2 


2 


2 
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is independent of the position of the gallows. This gives a simple algorithm for treasure-finding: take as the 
(hitherto) arbitrary origin, then the treasure is at — ^ . 


8.3.1 a = 


8.3.3 




4 


-1 


2 


= 2 


~h 3 


5 


1 


1 


= 2r^ + 3s*. 


8.3.4 Since ai = c?«T, ci 2 = , we may write 

-d * 

from where the assertion follows. 

8.3.5 

alt + plf = 0 =)> il.fcui + plf) **“c£.lf 

=)> afcf.ll) = 0 

Since Ti =/ if , we must have I a I =/ 0 and thus a = 0. But if a = 0 then 


aii + pi? = if 


pb = 0 
p = 0, 


since b =/ 0 . 

8.3.6 We must shew that 


(lx + 3y )«(2x — 3y ) = 0. 


But 


(it + 3rf ).(2x? — 3xf ) =4\(t\\ 2 - 9 1[^|| 2 = 4(-|[if|| 2 ) -9||Tf|| 2 = 0. 


8.3.7 We have Vv G M 2 , v •( a — b ) =0. In particular, choosing v = a — b , we gather 

fci — i}).fci —if) = ir^ — ifii 2 = o. 

But the norm of a vector is 0 if and only if the vector is the if vector. Therefore it — 1> =lf, i.e., ct = b . 

8.3.8 We have 

||it±l}|| = (it ±lf)«(it ±lf) 

= lt«lt ± 2lt»l} +lf.l} 

= ||^|| 2 ±2^.^ + ||^|| 2 , 

whence the result follows. 

8.3.9 We have 

ill +3 ?h 2 — iftf — "Vn 2 = (it + it Hit + it) — ni — it Hit — it) 

= it »lt + 21 t .it +V»V - (lt.lt - 21 t »v + it »v ) 

= 4 it »lt , 


giving the result. 
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8.3. 10 By definition 


proj^ 


prqj 


proj 1 • ^ 

ll^ll 2 




11^ 


(t-j) 2 


- H 


Since 0 < 


f^) 2 

m 2 m : 


imi 2 ii*h 

< 1 by the CBS Inequality, the result follows. 


-H, 


8.3.11 Clearly, if ~ci = 5 and A y" 0 then there are no solutions. If both ll = ~0 and A = 0, then the solution set is the 
whole space R 2 . So assume that ~Ci =/ O . By Theorem 365, we may write A? = ll -|- V* with proji = TI 1 1 Tj and _L a. 
Thus there are infinitely many solutions, each of the form 

=dl + T? = + T?, 


Hall 2 


Ha|| 2 


where v g a . 


8.4. 1 Since ll = 


2 

— ) 

i 


is normal to 2x — y = 1 and b = 

-3 


is normal to x — 3y = 1 . the desired angle can be 


obtained by finding the angle between the normal vectors: 

It.b 


( a , b ) = arccos 


5 1 7t 

arccos — = -= = arccos — = = — . 

V5 • \/To V2 4 


8.4.2 2(x - 1 ) + (y + 1 ) = 0 or 2x + y = 1 . 

8.4.3 By Chasles’ Rule AA' = ACi + GA 1 , B B ' = + GB 1 , and CC' = C<3 I GC\ Thus 


0 = AA' + BB' + CC' 

= AG + GA 1 + BG + GB 1 + CG! + GC 1 

= -(GA + GB + G(h + (GA 1 + GB 1 + GC 1 ) 
= GA 1 + GB 1 + GC 1 , 


whence the result. 

8.4.4 We have: 

O The points F, A, D are collinear, and so FA is parallel to FD, meaning that there is k £ 1 \ {0} such that 
FA = kFD. Since the lines (AB) and (DC) are parallel, we obtain through Thales' Theorem that FI = kFj and 
F^ = kFcl This gives 

FA — FI = k(FD — f|) =4 LA = kJD. 

Similarly 

F^ - f! = k(F^ - f|) => F^ = kj6. 

Since I is the midpoint of [A, B], IA + IB = 0, and thus k( JC2 + JT>) = ~0 . Since k y" 0, we have JC -\- JD = d*, 
meaning that J is the midpoint of [C, D]. Therefore the midpoints of [A, B] and [C, D] are aligned with F. 
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© Let J' be the intersection of the lines (El) and (DC). Let us prove that J' = J. 

Since the points E, A, C are collinear, there is l =/ 0 such that EA = l EC. Since the lines (ab) and (DC) are 
parallel, we obtain via Thales’ Theorem that El = 1EJ' and EB = IED. These equalities give 

eX-e! = l(E^-EJ^) => fX = lJ 7 C, 

E^-e! = 1(Et3-EJ^) => i^ = ij 7 d. 

Since I is the midpoint of [A, B], IA + fS = X, and thus l(J'C + J'D) = X. Since l =/0, we deduce J'C + J'D = X, 
that is, J' is the midpoint of [C, D], and so J' = J. 

8.4.5 We have: 

O By Chasles’ Rule 


a 

II 

■M-* 

a 


A^ + Bt = lAX^ - 

and 



a 

ii 

a 


AE> + fX = |AX^ ■ 

Adding, and observing that since ABCD is a parallelogram, All = Cl3, 

AB+Bt + AD + DE = A^ 

4=^ 

bI + de = ac^-ab-ad 



Bt + D^ = Ad3 + D^-Al^ 



bI = -dI 

The last equality shews that the lines (BE) and 

(DF) 

are parallel. 


© Observe that Bj = jBC? = jAD = a{ = —LA . Hence 

i) = fX + Al^ + b| = A^, 

proving that the lines (AB) and (IJ) are parallel. 


Observe that 

ft = fX + Afi = 1dX+1a^ = 1cB+FC? = CJ + E^ = EC? + CJ = FJ, 

2 4 2 

whence IEJE is a parallelogram. 

8.4.6 Since IE = 2 ID and [I, D] is a median of AABD, E is the centre of gravity of AABD. Let M be the midpoint 
of [B, D], and observe that M is the centre of the parallelogram, and so 2AM = aX + AE). Thus 

At = | AM = 1 (2AM) = -i(AB + AD). 

To shew that A, C, E are collinear it is enough to notice that Xt = 2 aC. 


8.4.7 Suppose A, B, C are collinear and that 


[A, B]| 
[B, C]| 


— . Then by the Section Formula 8.4, 
P 


-H ~k & + pit 
A + p ’ 


whence pit — (A + p) b + Ah’ = 0 and clearly p — (A + p) + A = 0. Thus we may take a = p, (3 = A + p, and y = A. 
Conversely, suppose that 


alt + (3 b + yet = 0, a+|3+y = 0 

for some real numbers a, p,y, not all zero. Assume without loss of generality that y / 0. Otherwise we simply 
change the roles of y, and a and (3 Then y = — (a+(3)y0. Hence 

alt + pX = ( a + |3 )lt =>■ -t = + , 

a + (3 

and thus [O, C] divides [A, B] into the ratio — , and therefore, A, B, C are collinear. 

a 
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8.4.8 Put OX = ~x? for X G {A.A'.BjB'.CYC^L.M^N, V). Using problem 8.4.7 we deduce 


T? + all + a' a' =lf, 

1 + a + a' 

= 0, 

(A. 1) 

1? + P^ + P'a 1 = 1?, 

1 + P + P 

= 0, 

(A. 2) 

Y? +y3 + y'a^ =~$, 

1 +y + y' 

= 0. 

(A. 3) 


From A. 2, A. 3, and the Section Formula 8.4 we find 

pt> — yY? P'b^-y'? -x 
P~Y ” P'-Y' ” ’ 

whence (p — y) l = pb— yll. In a similar fashion, we deduce 

(y — a)m = yl? — all, 

( a — p } rt = acl — plj . 

This gives 

(P — y)~f + (y — a)rrt + (a — p)tt , 

(P - Y) + (Y - «) + (a — P) = 0, 

and appealing to problem 8.4.7 once again, we deduce that L, M, N are collinear. 
8.5.1 [A] AJ>, [B] All 


8.5.2 Put 


Then either 


or 


1 


1 


-1 




~V \ r~V -s ~v ~v 


1 

x 

1 

= ( i + j + k)x ( t + j ) = j - r = 

1 

1 


0 


0 


3~d 3 cl 

PIT V2 


3 

V2 

3 

VI 

0 


3 ~d 

PH 


3 

VI 

3 

VI 

0 


will satisfy the requirements. 

8.5.3 The desired area is 


Ip^xP^II = 


0 

1 

-1 


1 

0 

-1 


= V3. 


8.5.4 It is not associative, since ix(ixj) = i x k = — j but ( i x i ) x j = Ox j = 0. 

8.5.5 We have x Y<? = — Y? x X* by letting 1/ = X* in 8. 15. Thus 2 Y? x X* = 0 - ' and hence Y<? x Y? = 0 . 

8.5.6 Ztfxl) 
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8 . 5.7 

7x (7 x b ) = bx(x x7) <t=l> (7»b )7 — (7»7) b = (b »7)7 — (b «7)7 <t=l> 7»7 = b «7 =0. 

The answer is thus {7 : 7 G r 7 x b }. 


8 . 5.8 


7 = 


7*b )7 + 6b + 2a x 7 


«+w 

-4 _ (7»7)7 + 61? + 37x1) 

l* + 3|Pl| 2 

8.5.9 Assume contrariwise that it, b , 7" are three unit vectors in R 3 such that the angle between any two of them 

is > — . Then 7 •!) < — 1 , 7 »7 < — 1 and 7 «7 < 1. Thus 
3 2 2 2 

||7+7+7|| 2 = h7|| 2 + ||7|| 2 + h7|| 2 

-|-2 a'* b + 27*7 + 277 cl 
< 1 + 1 + 1 -1 -1 - 1 

= 0, 

which is impossible, since a norm of vectors is always > 0. 

8.5.10 Take (7,7) £ X 2 and a £ R. Then 

7 x (it + ) = 7 x7 + alt x7 = 7 + a7 = 7 , 

proving that X is a vector subspace of R n . 

8.5.11 Since 7, 7 are linearly independent, none of them is Cl". Assume that there are (a, (3, y) £ R 3 such that 

alt + p7 + y7x7 = 7. (A.4) 

Since 7»(7 X 7 ) = 0, taking the dot product of A.4 with 7 yields a| 1 7 1 1 2 =0, which means that a = 0, since 1 17| | =/ 0. 
Similarly, we take the dot product with b and ^xb obtaining respectively, p = 0 and y = 0. This establishes linear 
independence. 

8.5.12 Since 7 i a x X = 7 , there are no solutions il a«b f' 0. Neither are there solutions if 7 = 0 and t) f' 0. If 
both a = b = 0 , then the solution set is the whole of R 3 . Assume thus that a • b = 0 and that a and b are linearly 
independent. Then 7 , b , 7 X b are linearly independent, and so they constitute a basis for R 3 . Any X* £ R 3 can be 
written in the form 

7 = a7 + p b + y7 x b . 

We then have 

b = 7x x 

= p7 x7 + y7 x (7 x7 ) 

= p7x7 + y((7»7)7 — (7»7)7). 

= p7x7— y(7»77) 

= p7x7-y||7|| 2 7, 

p7 x7 + ( — y 1 17 1 1 2 — 1 ft> = 7, 


from where 


which means that p = 0 and y = 


-, since 7, b ,7 X b are linearly independent. Thus 


7 = a7 


,7x7 


in this last case. 


258 


Appendix A 


8.5.13 Let~x?,l|,~x? , ,T/ / be vectors in R 3 and let a g R be a scalar. Then 


+ a{lt',^')) 


L fx/ + ant ' , t/ + ax| ' ) 

(5/ + oilt ' ) x1< + Ti x (V + ay* ' ) 
it xlc + odt ' xl< + ft x t/ + ft x at / ' 
L(5c,^) + aL(x' ,1)') 


8.6.1 The vectors 


and 


are coplanar. A vector normal to the plane is 


a- (-a) 


2a 

0-1 

= 

-1 

a — 0 


a 

0-(-a) 


a 

1 - 1 

= 

0 

2a -0 


2a 


2a 


a 


-2a 

-1 

X 

0 

= 

—3 a 2 

a 


2a 


a 


The equation of the plane is thus given by 


-2a 


x — a 

-3 a 2 

• 

y-o 

a 


z — a 


that is, 


2ax + 3a 2 y 

8.6.2 The vectorial form of the equation of the line is 


az = a 2 . 


Since the line follows the direction of 


the desired plane is 


i 


1 

0 

+ t 

-2 

1 


-1 


1 


1 

-2 

, this means that 

-2 

-1 


-1 


is normal to the plane, and thus the equation of 


(x-1) -2(y -1) - (z-1) =0. 
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8 . 6.3 Observe that (0, 0, 0) (as 0 = 2(0) = 3(0)) is on the line, and hence on the plane. Thus the vector 


1 - 0 


1 

- 1-0 

= 

-1 

- 1-0 


-1 


lies on the plane. Now, if x = 2y = 3z = t, then x = t, y = t/2, z = t/3. Hence, the vectorial form of the equation of 
the line is 


0 


1 


1 

0 

+ 1 

1/2 

= t 

1/2 

0 


1/3 


1/3 


This means that 


1 

1/2 

1/3 


also lies on the plane, and thus 


1 


1 


1/6 

-1 

X 

1/2 

= 

-4/3 

-1 


1/3 


3/2 


is normal to the plane. The desired equation is thus 


1 

6 % 


3 y + 2 Z 


0 . 


8 . 6.4 Put ax = by = cz = t, so x = t/a; y = t/b; z = t/c. The parametric equation of the line is 


X 


1/a 

y 

= t 

1/b 

z 


_1/c_ 


Thus the vector 


1/a 

1/b 

1/c 


is perpendicular to the plane. Therefore, the equation of the plane is 


1/a 


x- 1 


0 

1/b 

• 

y-i 

= 

0 

1/c 


z — 1 


0 


x y z 1 1 1 

b tT 3 = 

a b cab c 


We may also write this as 


bcx + cay + abz = ab + be + ca. 
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8.6.5 A vector normal to the plane is 


. The line sought has the same direction as this vector, thus the equation 


of the line is 


a 


2 


- ■ 




- 

X 


0 


a 

V 

= 

0 

+ t 

a 2 

z 


i 


a 2 


8 . 6.6 We have 
Hence if z = t, 


y = 1 


1 -V = 1 


y = -2. 


X 


t- 1 


-1 


1 

y 

= 

-2 

= 

-2 

+ t 

0 

Z 


t 


0 


1 


8.6.7 The vector 


2-1 


1 

1 - 0 

= 

1 

1 — (—13 


2 


lies on the plane. The vector 


1 


1 


1 

0 

X 

1 

= 

1 

1 


2 


-1 


is normal to the plane. Hence the equation of the plane is 


1 


x — 1 

1 

• 

y 

-1 


z + 1 


= 0 x + y — z = 2. 


8 . 6.8 We have x~cf = — t + 2) and if x b = 2 k — 3~i . By Theorem 398, we have 
t = — ~ct xt — 1? = — 2]< + 3^ + ~i — 2^ = 4i — 2) 


8.6.9 4x + 6y = 1 


2k. 


8 . 6.10 There are 7 vertices (V 0 = (0,0,0), Vi = (11,0,0), V 2 = (0,9,0), V 3 = (0,0,8), V 4 = (0,3,8), V 5 
V 6 = (4, 7, 0)) and 11 edges (V 0 V, , V 0 V 2 , V 0 V 3 , Vi V 5 , Vi V 6 , V 2 V 4 , V 3 V 4 , V 3 V 5 , V 4 V 5 , and V 4 V 6 ). 


8.7.2 Expand ||21™.i at|| 2 =0. 

8.7.3 Observe that , 1 = n. Then we have 



2 


< 




(9,0,2), 


giving the result. 
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8 . 7.4 Take fti,T?) G X 2 and «£l, Then 

+ on?) + cui»A? = 0 + 0 = 0, 


proving that X Is a vector subspace of IR n . 


8 . 7.5 Assume that 

Ai aj + • • • + Ak cik = 0 . 

Taking the dot product with dj and using the fact that at* a} = 0 for 1 =/ j we obtain 

0 -^*dj = Ajdj«Oj = Aj[|aj|| 2 . 


Since aj =/ 0 =+ ||aj|| 2 =/ 0, we must have Aj = 0. Thus the only linear combination giving the zero vector is the 
trivial linear combination, which proves that the vectors are linearly independent. 


8 . 7.6 This follows at once from the CBS Inequality by putting 


"J 


a 1 
1 


l 

£2 


2 

2 

, ^ = 


a_n_ 


n 

n 




and noticing that 


n 


.2 


n(n + 1 1 (2n + 1 ) 
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